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This last velmne of the Berkeley Physics Cunrse is devoted to the stvdy οἱ 
large-scale (i.e, sderoscopic} systems consisting of nany aiores ormucle- 
thus it provides au introduction to the subjects of statistical me- 
thermedynamics, and h The approach which 


subjects and does not proceed along conventivnal iines 
rather to adept a medern point of view and to show, in as 
simple uw way as possible, how the basic notions of atomic theory lead to a 
coherent conceptual framework capable of describing and predicting the 
properties af macroscopic systems. 

In writing this book I have tried to keep in mind « student who, 
unencumbered by any prior familiarity with the subject matter, i: encour 
tering it fov the first time from the vantage point of his previous study of 
elementary physics and atomic properties. 1 have therefore chosen an 
order of presentation which might seem well-motiveted to such ἡ student if 
he attempted to discover by himself how to gain an understanding of 
macrescopic systems. In seeking to make the presentation coherent and 
unifed, i have based the entire discussion upon the systematic elaborati 
of a single principle, the tendency of an isolated! system to approact ts 
situation of gseutest randomness. Although I have rest™ ᾿ 
to simple systems, T have treated'them by methods which are widely upp 
generalized. Above all. 1 have tried throu gho 
physital insight, the ability to perceive significant relationships 
simply Thus I have attempted to discuss physical ideas ai len 
vetting lost in mathematical formalism, to provide simple exampies to iles- 
trote abstract general concepts, to make numerical estin-etes of sigm*ficaut 
quantities, and to-relate the theory to the real world of observation ant 


the most furdamenta! con- 

pis which wentd be aseful to physicists as well as to students of chemistry, 
luology, or engineering. The Teaching aud Study Notes summarize the 
organization and contents of the book and provide some guides for the 


tion, designe:i τὸ stress the relation betw 
levels of deseription, does not necessacily sacrifice the virtzes inherent in 
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more traditional approaches. In particular, the following features may be 
worth mentioning: 

(i) The student who completes Chap. 7 (even if he omits Chap. 6) 
will know the fundamental principles and basic applications of classical 
thermodynamics as well as if he had studied the subject along traditional 
lines. Of course, he will also have acquired much more insight into 
the meaning of entropy and a considerable knowledge of statistical physics. 

(ii) I have been careful to emphasize that the statistical theory leads 
to certain results which are purely macroscopic in content and completely 
independent of whatever models one might assume about the atomic 
structure of the systems under consideration. The generality and model- 
independence of the classical thermodynamic Jaws is thus made explicitly 
apparent. 

(iii) Although a historical approach rarely provides the most logical 
or illuminating introduction to a subject, an acquaintance with the evolution 
of scientific ideas is both interesting and instructive. I have, therefore, 
included in the text some pertinent remarks, references, and photographs 
of prominent scientists, all designed to give the student some perspective 
concerning the historical development of the subject. 

The prerequisites needed for a study of this volume include, besides 
a rudimentary knowledge of classical mechanics and electromagnetism, 
only an acquaintance with the simplest atomic concepts and with the 
following quantum ideas in their most unsophisticated form: the meaning 
of quantum states and energy levels, Heisenberg’s uncertainty principle, 
the de Broglie wavelength, the notion of spin, and the problem of a 
free particle in a box. The mathematical tools required do not go beyond 
simple derivatives and integrals, plus an acquaintance with Taylor's series. 
Any student familiar with the essential topics covered in the preceding 
volumes of the Berkeley Physics Course (particularly in Vol. TV) would, of 
course, be amply prepared for the present book. The book could be used 
equally well, however, for the last part of any other modern introductory 
physics course or for any comparable course at, or above, the level of 
second-year college students. 

‘As I indicated at the beginning of this preface, my aim has been to 
penetrate the essence of a sophisticated subject sufficiently to make it seem 
simple, coherent, and easily accessible to beginning students. Although 
the goal is worth pursuing. it is difficult to attain. Indeed, the writing of 
this book was for me an arduous and lonely task that consumed an 
incredible amount of time and left me feeling exhausted. It would be 
some slight compensation to know that I had achieved my aim sufficiently 
well so that the book would be found useful. 
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Organization of the Book 
The book is divided into three main parts which I shall describe in turn: 


Part A: Preliminary Notions (Chapters 1 and 2) 


Chapter 1: _ This chapter provides a qualitative introduction to the most 
fundamental physical concepts to be explored in this book. It is designed to 
make the student aware of the characteristic features of macroscopic sys- 
tems and to orient his thinking along fruitful lines, 


Chapter 3: This chapter is somewhat more mathematical in nature and is 
intended to familiarize the student with the basic notions of probability 
theory. No prior knowledge of probability ideas is assumed. The concept 
of ensembles is stressed throughout and all examples are designed to illu- 
minate physically significant situations. Although this chapter is oriented 
toward subsequent applications in the remainder of the book, the proba- 
bility concepts discussed are, of course, intended to be useful to the student 
in far wider contexts. 


These chapters should not take too much time. Indeed, some students 
may Well have sufficient background preparation to be familiar with some 
of the material in these chapters. Nevertheless, I would definitely rec- 
ommend that such students not skip these two chapters, but that they con- 
sider them a useful review. 


Part B: Basic Theory (Chapters 3, 4, and 5) 
This part constitutes the heart of the book. The logical and quantitative 
development of the subject of this volume really starts with Chapter 3. 
(In this sense the first two chapters could have been omitted, but this 
would have been very unwise pedagogically.) 


Chapter 3: This chapter discusses how a system consisting of many 
particles is described in statistical terms. It also introduces the basic 
postulates of the statistical theory. By the end of this chapter the student 
should have come to realize that the quantitative understanding of macro- 
scopic systems hinges essentially on considerations involving the counting 
of the states accessible to the systems. He may not yet perceive, however, 
that this insight has much useful value. 


xii Teaching and Study Notes 


Chapter 4: This chapter constitutes the real pay-off. It starts out, inno- 
cently enough, by investigating how two systems interact by heat transfer 
alone. This investigation leads very quickly, however, to the fundamental 
concepts of entropy, of absolute temperature, and of the canonical distri- 
bution (or Boltzmann factor). By the end of the chapter the student is in 
a position to deal with thoroughly practical problems—indeed, he has 
learned how to calculate from first principles the paramagnetic properties 
of a substance or the pressure of an ideal gas. > 


Chapter 5: This chapter brings the ideas of the theory completely down 
to earth. Thus it discusses how one relates atomic concepts to macroscopic 
measurements and how one determines experimentally such quantities as 
absolute temperature or entropy. 


An instructor thoroughly pressed for time can stop at the end of these 
five chapters without too many pangs of conscience. At this point a stu- 
dent should have acquired a fairly good understanding of absolute temper- 
ature, entropy, and the Boltzmann factor—i.e., of the most fundamental 
concepts of statistical mechanics and thermodynamics. (Indeed, the only 
thermodynamic result still missing concerns the fact that the entropy 
remains unchanged in a quasi-static adiabatic process.) At this stage I 
would consider the minimum aims of the course to have been fulfilled. 


Part C: Elaboration of the Theory (Chapters 6, 7, and 8) 


This part consists of three chapters which are independent of each other in 
the sense that any one of them can be taken up without requiring the others 
as prerequisites. In addition, it is perfectly possible to cover only the first 
few sections of any one of these chapters before turning to another of 
these chapters. Any instructor can thus use this flexibility to suit his own 
predilections or the interests of his students. Of the three chapters, Ch. 7 
is the one of greatest fundamental importance in rounding out the theory; 
since it completes the discussion of thermodynamic principles, it is also the 
one likely to be most useful to students of chemistry or biology. 


Chapter 6: This chapter discusses some particularly important applica- 
tions of the canonical distribution by introducing approximate classical 
notions into the statistical description. The Maxwell velocity distribution 
of molecules in a gas and the equipartition theorem are the main topics of 
this chapter. Illustrative applications include molecular beams, isotope 
separation, and the specific heat of solids. 


Chapter 7: This chapter begins by showing that the entropy remains un- 
changed in a process which is adiabatic and quasi-static. This completes 
the discussion of the thermodynamic laws which are then summarized in 
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their full generality. The chapter then proceeds to examine a few impor- 
tant applications: general equilibrium conditions, including properties of 
the Gibbs free energy; equilibrium between phases; and implications for 
heat engines and biological organisms. 


Chapter 8: This last chapter is intended to illustrate the discussion of the 
nonequilibrium properties of a system. It treats the transport properties of 
a dilute gas by the simplest mean-free-path arguments and deals with 
viscosity, thermal conductivity, self-diffusion, and electrical conductivity. 


This completes the description of the essential organization of the book. 
In the course as taught at Berkeley, the aim is to cover the major part of this 
book in about eight weeks of the last quarter of the introductory physics 
sequence. 

The preceding outline should make clear that, although the presentation 
of the subject matter of the book is unconventional, it is characterized by 
a tight logical structure of its own. This logical development may well 
seem more natural and straightforward to the student, who approaches the 
topics without any preconceptions, than to the instructor whose mind is 
molded by conventional ways of teaching the subject. I would advise the 
instructor to think the subject through afresh himself. If sheer force of 
habit should lead him to inject traditional points of view injudiciously, he 
may disrupt the logical development of the book and thus confuse, rather 
than enlighten, the student. 


Other Features of the Book 


Appendix: The four sections of the appendix contain some peripheral 
material. In particular, the Gaussian and Poisson distributions are specifi- 
cally discussed because they are important in so many diverse fields and 
because they are also relevant in the laboratory part of the Berkeley Physics 
Course. 


Mathematical Notes: These notes constitute merely a collection of 
mathematical tidbits found useful somewhere in the text or in some of the 
problems. 


Mathematical Symbols and Numerical Constants: These can be found 
listed at the end of the book and also on its inside front and back covers. 


Summaries of Definitions: These are given at the ends of chapters for 
ease of reference and convenience of review. 


Problems: The problems constitute a very important part of the book. I 
have included about 160 of them to provide an ample and thought-provok- 
ing collection to choose from. Although I would not expect a student to 
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work through all of them, I would encourage him to solve an appreciable 
fraction of the problems at the end of any chapter he has studied; other- 
wise he is likely to derive little benefit from the book. Problems marked 
by stars are somewhat more difficult. The supplementary problems deal 
mostly with material discussed in the appendices. 


Answers to Problems: Answers to most of the problems are listed at the 
end of the book. The availability of these answers should facilitate the use 
of the book for self-study. Furthermore, although I would recommend 
that a student try to solve each problem before looking at the answer given 
for it, I believe that it is pedagogically valuable if he can check his own 
answer immediately after he has worked out a solution. In this way he can 
become aware of his mistakes early and thus may be stimulated to do 
further thinking instead of being lulled into unjustified complacenéy. (Al- 
though I have tried to assure that the answers listed in the book are 
correct, I cannot guarantee it. I should appreciate being informed of any 
mistakes that might be uncovered.) 


Subsidiary material: Material which consists of illustrations or various re- 
marks is set in two-column format with smaller type in order to differentiate 
it from the main skeleton of logical development. Such subsidiary mate- 
rial should not be skipped in a first reading, but might be passed over in 
subsequent reviews. 


Equation numbering: Equations are numbered consecutively within 
each chapter. A simple number, such as (8), refers to equation number 8 
in the chapter under consideration. A double number is used to refer to 
equations in other chapters. Thus (3.8) refers to Equation (8) in Chap. 3, 
(A.8) to Equation (8) in the Appendix, (M.8) to Equation (8) in the Mathe- 
matical Notes. 


Advice to the Student 


Learning is an active process. Simply reading or memorizing accomplishes 
practically nothing. Treat the subject matter of the book as though you 
were trying to discover it yourself, using the text merely as a guide that 
you should leave behind. The task of science is to learn ways of thinking 
which are effective in describing and predicting the behavior of the 
observed world. The only method of learning | new ways of thinking is to 
practice thinking. Try to strive for insight, to find new relationships and 
simplicity where before you saw none. Above all, do not simply memorize 
formulas; learn modes of reasoning. The only relations worth remembering 
deliberately are the few Important Relations listed explicitly at the end of 
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each chapter. 1 these are not sufficient to allow you to reconstruct in your 
head any other significant formula in about twenty seconds or less, you have 
not understood the subject matter. 

Finally, it is much more important to master a few fundamental con- 
cepts than to acquire a vast store of miscellaneous facts and formulas. If 
in the text I have seemed to belabor excessively some simple examples, such 
as the system of spins or the ideal gas, this has been deliberate. It is partic- 
ularly true in the study of statistical physics and thermodynamics that 
some apparently innocent statements are found to lead to remarkable con- 
clusions of unexpected generality. Conversely, it is also found that many 
problems can easily lead one into conceptual paradoxes or seemingly hope- 
less calculational tasks; here again, a consideration of simple examples can 
often resolve the conceptual difficulties and suggest new calculational pro- 
cedures or approximations. Hence my last advice is that you try to under- 
stand simple basic ideas well and that you then proceed to work many prob- 
lems, both those given in the book and those resulting from questions you 
may pose yourself. Only in this way will you test your understanding and 
learn how to become an independent thinker in your own right. 
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Characteristic Features of Macroscopic Systems 


Dass ich erkenne, was die Welt 

tm Innersten zusammenhait, 

Schau’ aile Wirkenskraft und Semen, 

Und τυ" nicht mehr in Worten kramen. 
Goethe, Faust} 


The entire world of which we are aware through our senses consists 
of objects that are macroscopic, i.e., large compared to atomic dimen- 
sions and thus consisting of very many atoms or molecules. This world 
is enormously varied and complex, encompassing gases, liquids, solids, 
and biological organisms of the most diverse forms and compositions. 
Accordingly, its study has formed the subject matter of physics, chem- 
istry, biology, and several other disciplines. In this book we want to 
undertake the challenging task of gaining some insights into the fun- 
damental properties of all macroscopic systems. In particular, we 
should like to investigate how the few unifying concepts of atomic 
theory can lead to an understanding of the observed behavior of mac- 
roscopic systems, how quantities describing the directly measurable 
properties of such systems are interrelated, and how these quantities 
can be deduced from a knowledge of atomic characteristics. 
Scientific progress made in the first half of this century has led to 
very basic knowledge about the structure of matter on the micro- 
scopic level, i.e., on the small scale of the order of atomic size 
(10-* cm). Atomic theory has been developed in quantitative detail 
and has been buttressed by an overwhelming amount of experimental 
evidence. Thus we know that all matter consists of molecules built up 
of atoms which, in turn, consist of nuclei and electrons. We also know 
the quantum laws of microscopic physics governing the behavior of 
atomic particles. Hence we should be in a good position to exploit 
these insights in discussing the properties of macroscopic objects. 
Indeed, let us justify this hope in greater detail. Any macroscopic 
system consists of very many atoms. The laws of quantum mechanics 
describing the dynamical behavior of atomic particles are well estab- 
lished. The electromagnetic forces responsible for the interactions 
between these atomic particles are also very well understood. 


+ From Faust’s opening soliloquy in Goethe's play, Part I, Act I, Scene 1, lines 382-385, 
‘Translation: “That I may recognize what holds the world together in its inmost essence, 
behold the driving force and source of everything, and rummage no more in empty words.” 
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Ordinarily they are the only forces relevant because gravitational forces 
between atomic particles are usually negligibly small compared to elec- 
tromagnetic forces. In addition, a knowledge of nuclear forces is usu- 
ally not necessary since the atomic nuclei do not get disrupted in most 
ordinary macroscopic physical systems and in all chemical and biologi- 
cal systems.} Hence we can conclude that our knowledge of the laws 
of microscopic physics should be quite adequate to allow us, in prin- 
ciple, to deduce the properties of any macroscopic system from a 
knowledge of its microscopic constituents. 

It would, however, be quite misleading to stop on this optimistic 
note. A typical macroscopic system of the type encountered in every- 
day life contains about 1055 interacting atoms. Our concrete scientific 
aim is that of understanding and predicting the properties of such a 
system on the basis of a minimum number of fundamental concepts. 
We may well know that the laws of quantum mechanics and electro- 
imagnetism describe completely all the atoms in the system, whether 
it be a solid, a liquid, or a human being. But this knowledge is utterly 
useless in achieving our scientific aim of prediction unless we have 


tems. The difficulties involved are not of a type which can 
ved merely by the brute force application of bigger and better 
electronic computers. The problem of 1025 interacting particles 
dwarfs the capabilities of even the most fanciful of future computers; 
furthermore, unless one asks the[right questions, reams of computer 
output tape are likely to provide no insight whatever into the essential 
features of a problem. It is also worth emphasizing that complexity 
involves much more than questions of quantitative detail. In many 
cases it can lead to remarkable qualitative features that may seem quite 
unexpected. For instance, consider a gas of identical simple atoms 
(eg., helium atoms) which interact with each other through simple 
known forces. It is by no means evident from this microscopic infor- 
mation that such a gas can condense very abruptly so as to form a 
liquid. Yet this is precisely what happens. An even more striking 
illustration is provided by any biological organism. Starting solely 
from a knowledge of atomic structure, would one suspect that a few 
simple kinds of atoms, forming certain types of molecules, can give 
rise to systems capable of biological growth and self-reproduction? 
The understanding of macroscopic systems consisting of very many 
particles thus requires primarily the formulation of new concepts ca- 
pable of dealing with complexity. These οἱ concepts, based ultimately 


{Gravitational and nuclear interactions may, however, become pertinent in some astro- 
physical problems. 
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4 Characteristic Features of Macroscopic Systems 


upon the known fundamental laws of microscopic physics, should 
achieve the following aims: make apparent the parameters most useful 
in describing macroscopic systems; permit us to discern readily the 
essential characteristics and regularities exhibited by such systems; and 
finally, provide us with relatively simple methods capable of predicting 
quantitatively the properties of these systems. 

The discovery of concepts sufficiently powerful to achieve these 
aims represents clearly ἃ major intellectual challenge, even when the 
fundamental laws of microscopic physics are assumed known. It is 
thus not surprising that the study of complex systems consisting of 
many atoms occupies much attention in research at the forefront of 
physics. On the other hand, it is remarkable that quite simple reason- 
ing is sufficient to lead to substantial progress in the understanding of 
macroscopic systems. As we shall see, the basic reason is that the very 
presence of a large number of particles allows one to use statistical 
methods with singular effectiveness, 

It is by no means obvious how we should go about achieving our 
aim of understanding macroscopic systems. Indeed, their apparent 
complexity may seem forbidding. In setting out on our path of dis- 
covery we shall, therefore, follow good scientific procedure by first 
examining some simple examples. At this stage we shall not let our 
imagination be stifled by trying to be rigorous or overly critical. Our 
pee this chapter is rather to recognize the essential features 
characteristic of macroscopic systems, to see the main problems in 
qualitative outline, and to get some feeling for typical magnitudes. 
This preliminary investigation should serve to suggest to us appropri- 
ate methods for attacking the problems of macroscopic systems in a 
systematic and quantitative way. 


1.1 Fluctuations in Equilibrium 

A simple example of a system consisting of many particles is a gas of 
identical molecules, e.g., argon (Ar) or nitrogen (Nz) molecules. If the 
gas is dilute (ie, if the number of molecules per unit volume is small), 
the average separation between the molecules is large and their mutual 
interaction is correspondingly small. The gas is said to be ideal if it 
is sufficiently dilute so that the interaction between its molecules is 
almost negligible.} An ideal gas is thus particularly simple. Each of 
its molecules spends most of its time moving like a free particle unin- 


{ The interaction is “almost” negligible if the total potential energy of interaction be- 
tween the molecules is negligible compared to their total kinetic energy, but is sufficiently 
large so that the molecules can interact and thus exchange energy with each other. 
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fluenced by the presence of the other molecules or the container walls; 
only rarely does it come sufficiently close to the other molecules or the 
container walls so as to interact (or collide) with them. In addition, if 
the gas is sufficiently dilute, the average separation between its mole- 
cules is much larger than the average de Broglie wavelength of a 
molecule. In this case quantum-mechanical effects are of negligible 
importance and it is permissible to treat the molecules as distinguish- 
able particles moving along classical trajectories. + ὝΨΗ 

Consider then an ideal gas of N molecules confined within a con- 
tainer or box. In order to discuss the simplest possible situation, sup- 
pose that this whole system is isolated (ie., that it does not interact 
with any other system) and that it has been left undisturbed for a very 
long time. We now imagine that we can observe the gas molecules, 
without affecting their motion, by using a suitable camera to take a 
motion picture of the gas. Successive frames of the film would show 
the positions of the molecules at regular intervals separated by some 
short time το. We then could examine the frames individually, or 
alternatively, could run the movie through a projector. 

In the latter case we would observe on the projection screen a pic- 
ture showing the gas molecules in constant motion: Thus any given 
molecule moves along a straight line until it collides with some other 
molecule or with the walls of the box; it then continues moving along 
some other straight line until it collides again; and so on and so forth. 
Each molecule moves strictly in accordance with the laws of motion 
of mechanics. Nevertheless, N molecules moving throughout the box 
and colliding with each other represent a situation so complex that the 
picture on the screen appears rather chaotic (unless N is very small). 

Let us now focus attention on the positions of the molecules and 
their distribution in space. To be precise, consider the box to be di- 
vided by some imaginary partition into two equal parts (see Fig. 1.1). 
Denote the number of molecules in the left half of the box by n and 
the number in the right half by n’. Of course 


n+n’=N, (1) 


the total number of molecules in the box. If N is large, we would 
ordinarily find that n = n’, i.e., that roughly half of the molecules are 
in each half of the box. We emphasize, however, that this statement 
is only approximately true. Thus, as the molecules move throughout 
the box, colliding occasionally with each other or with the walls, some 


{The validity of the classical approximation will be examined more extensively in 
Sec. 6.3, 


nt+n=N 


Fig. 1.1 A box containing an ideal gas con- 
sisting of N molecules. The box is shown sub- 
divided into two equal parts by an imaginary 
partition. The number of molecules in the 
left half is denoted by n, the number of mole- 
cules in the right half by n’. 


@ 
Φ 


᾿ 
i 
Ρ 
k 


Fig. 1.2 Schematic diagram illustrating the 
4 different ways in which 2 molecules can be 
distributed between the two halves of a box. 
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of them enter the left half of the box, while others leave it. Hence the 
number n of molecules actually located in the left half fluctuates con- 
stantly in time (see Figs. 1.3 through 1.6). Ordinarily these fuctua- 
tions are small enough so that n does not differ too much from 4N. 
There is, however, nothing that prevents all molecules from being 
in the left half of the box (so that n = N, while n’ = 0). Indeed this 
might happen. But how likely is it that it actually does happen? 

To gain some insight into this question, let us aski in how many ways 
the molecules can be distributed between the two halves of the box. 
We shall call each distinct way way in which the molecules can be distrib- 
uted between these two halves a configuration. A single molecule can 
then be found in the box in two possible configurations, i.e., it can be 
either in the left half or in the right half. Since the two halves have 
equal volumes and are otherwise equivalent, the molecule is equally 
likely to be found in either half of the box.+ If we consider 2 molecules, 
each one of them can be found in either of the 2 halves. Hence the 
total number of possible configurations (i.e., the total number of pos- 
sible ways in which the 2 molecules can be distributed between the 
two halves) is equal to 2 x 2 = 32 = 4 since there are, for each pos- 
sible configuration of the first molecule, 2 possible configurations of 
the other (see Fig. 1.2). If we consider 3 molecules, the total number 
of their possible configurations is equal to 2 x 2 x 2 = 2% = 8 since 
there are, for each of the 32 possible configurations of the first 2 mole- 
cules, 2 possible configurations of the last one. Similarly, if we con- 
sider the general case of N molecules, the total number of possible 
configurations is 2x 2x --- X 2= 2%, These configurations are 
listed explicitly in Table 1.1 for the special case where N = 4. 

Note that there is only one way of distributing the N molecules so 
that all N of them are in the left half of the box. It represents only 
one special configuration of the molecules compared to the 2" possible 
configurations of these molecules. Hence we would expect that, 
among a very large number of frames of the film, on the average only 
one out of every 2% frames would show all the molecules to be in the 
left half. If Py denotes the fraction of frames showing all the N mole- 
cules located in the left half of the box, i.e., if Py denotes the relative 
frequency, or probability. of finding all the N molecules in the left 
half, then 


Py = (2) 


oN 

# We assume that the likelihood of finding a particular molecule in any half of the box 
is unaffected by the presence there of any number of other molecules. This will be true 
if the total volume occupied by the molecules themselves is negligibly small compared to 
the volume of the box. 
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Computer-constructed pictures 


The following pages and several subse- 
quent ones show figures constructed by 
means of a high-speed electronic digital 
computer. The situation investigated in 
every case is the classica! motion of sev- 
eral particles in a box, the particles being 
represented by disks moving in two di- 
mensions. The forces between any two 
particles, or between a particle and a 
wall, are assumed to be like those be- 
tween “‘hard’' objects (i.e., to vanish 
when they do not touch and to become 
infinite when they do touch). All result- 
ing collisions are thus elastic. The com: 
puter is given some initial specified posi- 
tions and velocities of the particles. It 
is then asked to solve numerically the 
equations of motion of these particles 
for all subsequent (or prior) times and 


to display pictorially on a cathode-ray 
oscilloscope the positions of the moie- 
cules at successive times t = jz» where 
τὸ is some small fixed time interval and 
where j= 0, 1, 2, 3,.... A movie 
camera photographing the oscilloscope 
screen then yields the successive picture 
frames reproduced in the figures. (The 
time interval τὸ was chosen long enough 
so that several molecular collisions occur 
between the successive frames displayed 
in the figures.) The computer is thus 
used to simulate in detail a hypothetical 
experiment involving the dynamical in- 
teraction between many particles. 

All the computer-made pictures were 
produced with the generous cooperation 
of Dr. B. J, Alder of the Lawrence Radia- 
tion Laboratory at Livermore. 
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located in each half of the box is printed directly beneath that 
half. The short line segment emanating from each particle indi- 
cates the direction of the particle’s velocity. 


Fig. 1.3 Computer-made pictures showing 4 particles in a box. 
The fifteen successive frames (labeled by j = 0, 1, 2,..., 14) 
are pictures taken a long time after the beginning of the compu- 
tation with assumed initial conditions. The number of particles 
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Fig. 1.4. Computer-made pictures showing 40 particles ina box. _ tation with assumed initial conditions. The number of particles 
The fifteen successive frames (labeled by. j = 0, 1, 2,..., 14) — located in each half of the box is printed directly beneath that 
are pictures taken a long time after the beginning of the compu- half. The velocities of the particles are not indicated. 
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Fig. 1.5 The number n of particles in the left 
half of the box as a function of the frame index 
j or the elapsed time t = jro. The number πὶ 
in the jth frame is indicated by a horizontal 
line extending from ἢ to j + 1. The graphs 
describe Fig. 1.3 for N = 4 particles and Fig. 
1.4 for N = 40 particles, but contain informa- 
tion about more frames than were shown there. 


Fig. 1.6 The relative number n/N of particles 
in the left half of the box as a function of the 
frame index j or the elapsed time t = ἦτο. The 
information presented is otherwise the same 
as that in Fig, 1.5. 
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Similarly, the case where no molecule at all is in the left half is also 
very special since there is again only one such configuration of the 
molecules out of 2% possible configurations. Thus the probability Po 
of finding no molecule located in the left half should also be given by 


Rad. 3) 


More generally, consider a situation where n of the N molecules of 
the gas are located in the left half of the box and let us denote by 
C(n) the number of possible configurations of the molecules in this 
case. [That is, C(n) is the number of possible ways the molecules can 
be distributed in the box so that n of them are found in the left half 
of the box.] Since the total number of possible configurations of the 
molecules is 2%, one would expect that, among a very large number of 


frames of the film, on the average C(n) out of every 2% such frames 


would show n molecules to be in the left half of the box. If P, denotes 
the fraction of frames showing n molecules located in the left half, i-e., 
if P,, denotes the relative frequency, or probability, of finding n mole- 
cules in the left half, then 


— Gln) 


P= (4) 


Example 


Consider the special case where the gas respondingly n’ = N—n molecules in 
consists of only four molecules. The the right half) is given by: 
number C(n) of possible configurations 


of each kind is listed in Table 1.1. Sup- Ps = Po = ἐξ, 

pose that a movie of this gas consists of 

8 great many frames. Then we expect P3 =P, = Ξ ἢ. (4a) 
that the fraction P, of these frames show- 

ing n molecules in the left half (and cor- Ῥχ τ ἧς τ ἢ. 


As we have seen, a situation where n = N (or where n = 0) corre- 
sponds only to a single possible molecular configuration. More gen- 
erally, if N is large, then C(n) <2" if n is even moderately close to N 
(or even moderately close to 0). In other words, a situation where the 
distribution of molecules is so nonuniform that n>>4N (or that 
n<4N) corresponds to relatively few configurations. A situation of 
this kind, which can be obtained in relatively few ways, is rather spe- 
cial and is accordingly said to be relatively nonrandom or orderly; 
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according to (4), it occurs relatively infrequently. On the other hand, 
a situation where the distribution of the molecules is almost uniform, 
so that n = π΄, corresponds to many possible configurations; indeed, as 
is illustrated in Table 1.1, C(n) is maximum if n = π' = ἐν. A situa- 
tion of this kind, which can be obtained in many different ways, is said 
to be random or disordered; according to (4) it occurs quite frequently. 
In short, more random (or uniform) distributions of the molecules in 
the gas occur more frequently than less random ones. The physical 
reason is clear: All molecules must move in a very special way if they 
are to concentrate themselves predominantly in one part of the box; 
similarly, if they are all located in one part of the box, they must move 
in a very special way if they are to remain concentrated there. 

The preceding statements can be made quantitative by using Eq. (4) 
to calculate the actual probability of occurrence of a situation where 
any number n of molecules are in the left half of the box. We shall 
postpone until the next chapter the requisite computation of the num- 
ber of molecular configurations C(n) in the general case, It is, how- 
ever, very easy and illuminating to consider an extreme case and ask 
how frequently one would expect all the molecules to be located in 
the left half of the box. Indeed, (2) asserts that a fluctuation of this 
kind would, on the average, be observed to occur only once in every 
2% frames of the film. 

To gain an appreciation for magnitudes, we consider some specific 
examples. If the gas consisted of only 4 molecules, all of them would, 
on the average, be found in the left half of the box once in every 16 
frames of the film. A fluctuation of this kind would thus occur with 
moderate frequency. On the other hand, if the gas consisted of 80 
molecules, all of these would, on the average, be found in the left half 
of the box in only one out of 28° = 104 frames of the film. This 
means that, even if we took a million pictures every second, we would 
have to run the film for a time appreciably greater than the age of the 
universe before we would have a reasonable chance of obtaining one 
frame showing all the molecules in the left half of the box. Finally, 
suppose that we consider as a realistic example a box having a volume 
of 1 cm3 and containing air at atmospheric pressure and room tem- 
perature. Such a box contains about 2.5 x 1019 molecules. [See 
Eq. (27) later in this chapter.] A fluctuation where all of these are 
located in one half of the box would, on the average, appear in only 
one out of 


There are about 3.15 x 107 seconds in a year and the estimated age of the universe is 
of the order of 1019 years, 
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frames of the film. (This number of frames is so fantastically large 
that it could not be accumulated even though our film ran for times 
incredibly larger than the age of the universe.) Fluctuations where 
not all, but a predominant majority of the molecules are found in one 
half of the box, would occur somewhat more frequently; but this fre- 
quency of occurrence would still be exceedingly small. Hence we 
arrive at the following general conclusion: If the total number of par- 
ticles is large, fluctuations corresponding to an appreciably nonuniform 
distribution of these molecules occur almost never. 

Let us now conclude by summarizing our discussion of the isolated 
ideal gas which has been left undisturbed for a long time. The num- 
ber n of molecules in one half of the box fluctuates in time about the 
constant value 4N which occurs most frequently. The frequency of 
occurrence of a particular value of n decreases rapidly the more n 
differs from 4N, i.e., the greater the difference |An| where 


An=n-4N. 6) 


Indeed, if N is large, only values of n with |An| <N occur with sig- 
nificant frequency. Positive and negative values of An occur equally 
often. The time dependence of n has thus the appearance indicated 
schematically in Fig, 1.7. 

The gas can be described in greatest detail by specifying its micro- 
scopic state, or microstate, at any time, i.e., by specifying the maximum 
possible information about the gas molecules at this time (e.g., the 
position and velocity of each molecule). From this microscopic point 
of view, a hypothetical film of the gas appears very complex since the 
locations of the individual molecules are different in every frame of 
the film. As the individual molecules move about, the microscopic 
state of the gas changes thus in a most complicated way. Froma large- 
scale or macroscopic point of view, however, one is not interested in 
the behavior of each and every molecule, but in a much less detailed 
description of the gas. Thus the macroscopic state, or macrostate, of 
the gas at any time might be quite adequately described by specifying 
merely the number of molecules located in any part of the box at this 
time.{ From this macroscopic point of view, the isolated gas which 


# To be specific, we could imagine that the box is subdivided into many equal cells, each 
having a volume large enough to contain many molecules ordinarily. ‘The macroscopic 
state of the gas could then be described by specifying the number of molecules located in 
each such cell. 


Fig. 1.7 Schematic illustration showing how 
the number n of molecules in one half of a box 


fluctuates as a function of the time t. 
total number of molecules is N. 


The 
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has been left undisturbed for a long time represents a very simple 
situation since its macroscopic state does not tend to change in time. 
Indeed, suppose that, starting at some time f;, we observed the gas over 
some moderately long period of time τ by taking a movie of it. Alter- 
natively, suppose that, starting at some other time fz, we again observed 
the gas over the same time 7 by taking a movie of it. From a macro- 
scopic point of view these two movies would ordinarily look indistin- 

ishable. In each case the number n of particles in the left half of 
the box would ordinarily fluctuate about the same value 4N, and the 
magnitude of the observed fluctuations would ordinarily also look alike. 
Disregarding very exceptional occurrences (to be discussed in the next 
section), the observed macrostate of the gas is thus independent of the 
starting time of our observations; 1.6., we can say that the macrostate 
of our gas does not tend to change in time. In particular, the value 
about which n fluctuates (or, more precisely, its average value) does not 
tend to change in time. A system of many particles (such as our gas) 
whose macroscopic state does not tend to change in time is said to be 
in equilibrium. 


Remark 


To define the concept of a time average 
in precise terms, let us denote by n(t) the 
number of molecules in the left half of 
the box at any time t. The time average 
value of n at any time t, taken over the 
time interval τ, can then be denoted by 
[n(n]. and is defined by 


(nt) = tf nr) dt’. 6) 


Equivalently, if a movie strip beginning at 
the time t extends over a time 7 and con- 
tains g = τήτο frames occurring at suc- 
cessive times t=, te=t+ τὸ, 
ty = t+ 270,....t) =t + (g — Ito, the 
definition (6) becomes 


[ri], = Elna) + n(te) + +++ + n(t,)]. 


If we omit explicit indication of the time 
interval τ to be considered, πίι) implies 
an average over some appropriately 
chosen time interval τ of appreciable 
length. In the equilibrium situation of 
our gas, π tends to be constant and equal 
τον. 
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1.2 Irreversibility and the Approach to Equilibrium 
Consider an isolated gas consisting of a large number N of molecules. 
If the fluctuations occurring in this gas in equilibrium are such that n 
is ordinarily very close to its most probable value 4N, under what 
conditions can we expect to find situations where n differs very appre- 
ciably from 4N? Such situations can occur in two different ways which 
we shall discuss in turn. 


Rarely occurring large fluctuations in equilibrium 

Although in the gas in equilibrium n is ordinarily always close to ἐν, 
values of n far from 4N can occur, but they do so quite rarely. If we 
observe the gas long enough, we may thus succeed in observing, at 
some particular instant of time t, a value of n appreciably different 
from 4N. 

Suppose that such a large spontaneous fluctuation of |An| has oc- 
curred, e.g., that n at some particular time ty assumes a value ny which 
is much greater than ἐν. What can we then say about the probable 
behavior of n as time goes on? To the extent that [π — 3N) is large, 
the particular value n; corresponds to a highly nonuniform distribution 
of the molecules and occurs very rarely in equilibrium. It is then most 
likely that the value nq occurs as a result of a fluctuation which is rep- 
resented by a peak whose maximum is near my (as indicated by the 
peak marked X in Fig. 1.8). The reason is the following: It might be 
possible for a value as large as nj to occur also as a result of a fluctua- 
tion represented by a peak whose maximum is larger than ny (such as 
the peak marked Y in Fig. 1.8); but the occurrence of such a large 
fluctuation is much less likely still than the already rare occurrence of 
a smaller fluctuation such as X. Thus we may conclude that it is indeed 
most likely that the time ἔχ, where n = nj, corresponds to a peak (such 
as X) where n is maximum. The general behavior of n as a function 
of time is then, however, apparent from Fig. 1.8. As time proceeds, 
n must tend to decrease (with accompanying small fluctuations) until 
it reverts to the usual equilibrium situation where it no longer tends to 
change, but fluctuates merely about the constant average value 4N. 
The approximate time required for the large fluctuation (where n = 3) 
to decay back to the equilibrium situation (where n = HN) is called 
the relaxation time for decay of this fluctuation. Note that, among 
many movie strips of duration 7, a movie strip showing the gas near 
the time t where a large fluctuation occurs is highly exceptional. Not 
only does such a movie strip occur very rarely but, if it does occur, it 


Fig. 1.8 Schematic illustration showing rare 
instances where the number τὶ of molecules in 
one half of a box exhibits large fluctuations 
about its equilibrium value IN. 
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Fig. 1.9 The piston in (a) is moved to the 
position (b) so as to compress the gas into the 
left half of the box. When the piston is sud- 
denly restored to its initial position, as shown 
in (c), the molecules immediately afterward 
are all located in the left half of the box, while 
the right half is empty. 
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is distinguishable from other movie strips since it shows a situation 
which tends to change in time.+ 

Our comments can thus be summarized in the following way: If n 
is known to assume a value n; which is appreciably different from its 
equilibrium average value 4N, then n will almost} always change in 
such a direction as to approach the equilibrium value ὁ In more 
physical terms, the value n; corresponds to a very nonuniform distri- 
bution of the molecules and the molecules would have to move in a 
ve ial way in order to preserve such nonuniformity. The inces- 
sant motion of the molecules thus almost always results in mixing them 
up so thoroughly that they become distributed over the entire box in 
the most random (or uniform) possible way. (See Figs. 1.15 through 
1,20 at the end of this section.) 


Remarks 


Note that the statements of the preceding 
paragraph are equally applicable whether 
the large fluctuation (n, — 4N) is positive 
or negative. If it is positive, the value n, 
will almost always correspond to the max- 
imum of a fluctuation in n (such as that 
indicated by the peak X in Fig. 1.8). If 
it is negative, it will almost always corre- 
spond to the minimum of a fluctuation in 
n. The argument leading to the conclu- 
sion of the paragraph remains, however, 
essentially identical. 

Note also that the statements of the 
paragraph remain equally valid irrespec- 
tive of whether the change in time is in 
the forward or backward direction (i-e., 
irrespective of whether a movie of the gas 
is played forward or backward through a 
Projector). If πὶ corresponds to a maxi- 
mum such as that indicated by the peak 
X at time t;, then n must. decrease both 
for t > t; and for t << ti. 


} This does not contradict the statement that the gas is in equilibrium in the long run, 
i.e, when observed over a very long time which may contain several occurrences of fluc- 
tuations as large as nj. 

{We use the qualifying word “almost” since, instead of corresponding to a maximum 
of a peak such as X, the value n may very rarely lie on the rising side of a peak such as Y. 
In this case n would initially increase, ie., change away from its equilibrium value $N. 
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Specially prepared initial situations 
Although a nonrandom situation, where n is appreciably different from 
4N, may occur as a result of a spontaneous fluctuation of the gas in 
equilibrium, such a large fluctuation occurs so rarely that it would al- 
most never be observed in practice. [Remember the numerical esti- 
mates based on Eq. (2) or (3).] Most macroscopic systems with which 
we deal, however, have not remained isolated and undisturbed for very 
long periods of time and, accordingly, are not in equilibrium. Non- 
random situations thus occur quite commonly, not as a result of spon- 
taneous fluctuations of a system in equilibrium, but as a result of 
interactions which affected the system at some not too distant time 
in the past. Indeed, it is quite easy to bring about a nonrandom situa- 
tion of a system by means of external intervention. 


Examples 


When a wall of 8 box is made movable, it 
becomes a piston. One can use such a 
piston (85 shown in Fig. 1.9) to com- 
press @ gas into the left half of a box. 
When the piston is suddenly restored to 
its initial position, all the molecules im- 
mediately afterward are still in the left 
half of the box. Thus one has produced 
an extremely nonuniform distribution of 
the molecules in the box. 

Equivalently, consider a box divided 
into two equal parts by a partition (see 
Fig. 1.10). Its left half is filled with N 
molecules of ἃ gas, while its right half is 
empty. If the gas is in equilibrium under εἰ. 
these circumstances, the distribution of 
its molecules is essentially uniform 
throughout the left half of the’ box. Im- 
agine that the partition is now suddenly 
removed. The molecules immediately 
afterward are then still uniformly distrib- 
uted throughout the left half of the box. 
This distribution, however, is highly non- 
uniform under the new conditions which 
leave the molecules free to move through- 
out the entire box. 
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Fig. 1.10 When the partition in (a) is sud- 
denly removed, all the molecules immediately 
afterward are located in the left half of the 
box, as shown in (b). 
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Suppose that an isolated system is known to be in a highly nonran- 
dom situation, e.g., suppose that all the molecules in a gas are known 
to be predominantly in the left half of the box so that n is appreci bly 
different from $N. Then it is essentially irrelevant whether the system 
got into this condition by virtue of a very rare spontaneous fluctuation 
in equilibrium, or whether it got there by virtue of some form of prior 
external intervention. Irrespective of its past history, the subsequent 
behavior of the system in time will thus be similar to that discussed 
previously when we considered the decay of a large fluctuation in 
equilibrium. In short, since nearly all possible ways in which the mole- 
cules of the system can move will result in a more random distribution 
of these molecules, the situation of the system will almost always tend 
to change in time so as to become as random as possible. After the 
most random condition has been attained, it exhibits then no further 
tendency to change, i.e., it represents the ultimate equilibrium condi- 
tion attained by the system. For example, Fig. 1.11 indicates sche- 
matically what happens after the partition of Fig. 1.10 is suddenly 
removed. The number n of molecules in the left half tends to change 
from its initial value n = N (corresponding to a highly nonuniform 
distribution of the molecules in the box) until the ultimate equilibrium 
situation is attained where n = 4N (corresponding to an essentially 
uniform distribution of molecules in the box). (See Figs. 1.12 and 1.18.) 

The important conclusion which we have reached in this section 
can thus be summarized as follows: 


If an isolated system is in an appreciably nonrandom sit- 
uation, it will (except for fluctuations which are unlikely 
to be large) change in time so as to approach ultimately 
its most random situation where it is in equilibrium. 


(7) 


Note that the preceding statement does not make any assertions 
about the relaxation time, 1.6., the approximate time required for the 
system to reach its final equilibrium situation. The actual magnitude 
of this time depends sen: 'y on details of the systern under 
consideration; it might be of the order of microseconds or of the order 
of centuries. 


Fig. 1.11 The box of Fig. 1.10 is here shown 
(a) immediately after the partition has been 
removed, (b) a short time thereafter, and 
(c) a long time thereafter. A movie played 
backward would show the pictures in the re- 
verse order (c), (Ὁ), (a). 


+ 


+ 
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Example 


Referring to Fig. 1.10, consider again the 
box divided into two equal parts by 8 par- 
tition. The left half of the box contains 
N molecules of gas. while its right half is 
empty. Imagine now that the partition 
is suddenly removed—but only partly (as 
shown in Fig. 1.13) rather than fully (as 
in the previous experiment of Fig. 1.10). Fig. 1.12 Schematic illustration showing how 
\n both experiments the nonrandom ai the number n of molecules in the left half of 
wUon kumneciately sitar. the μα ον ον g ls the box in Fig, 1.11 varies as a function of the 


removed (when the number n of mole- ‘A Buea i 2 a 
cules in the left half is equal to N’) tends time 1, beginning with the instant immediately 


to change in time until the molecules be- after the partition is removed. The relaxation 
come essentially uniformly distributed time is indicated by τ. 


throughout the entire box (so that 
n=4N). But the time required until 
the final equilibrium condition is reached 
will be longer in the experiment of Fig. 
1.13 than in that of Fig. 1.10. 


Irreversibility (a) 
The statement (7) asserts that, when an isolated macroscopic system 
changes in time, it tends to do so ina very definite direction—namely, 
from ἃ Tess random tp a more random situation, We could observe 
the process of change by taking a movie of the system. Suppose now 
that we played the movie backward through a projector (ie., that we 
ran the film through the projector in a direction opposite to that used 
in the original movie camera photographing the process). We would 
then observe on the screen the time-reversed process, i.e., the process 
that would occur if one imagined the direction of time to be reversed. 
The movie on the screen would look very peculiar indeed since it would 
portray a process in which the system changes from a more raridom 
to a much less random situation—something which one would almost ἰ (Ὁ) 

never observe in actuality. Just by watching the movie on the screen, Ἑ 

we could conclude with almost complete certainty that the movie is Fig. 1.13. The partition in (a) is suddenly re- 
being played backward through the projector. moved partway only, as shown in (b). 


THE SATURDAY EVENING POST 


Fig, 1.14 This cartoon is humorous because 
it portrays the reverse of an irreversible proc 
ess. The indicated sequence of events could 
happen, but it is exceedingly unlikely that it 
ever would. (Reprinted by special permission 
of The Saturday Evening Post and James 
Frankfort, © 1965 The Curtis Publishing Com- 


pany.) 
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Example 


For example, suppose one filmed the 
process that takes place after the parti- 
tion in Fig. 1.10 is suddenly removed. 
The movie played forward through the 
projector would show the gas spreading 
out (as indicated in Fig. 1.11) until it be- 
comes essentially uniformly distributed 
throughout the entire box. Such a proc: 
ess is quite familiar. On the other hand, 
the movie played backward would show 
the gas, initially distributed uniformly 
throughout the box, concentrating itself 
spontaneously in the left half of the box 
so as to leave the right half empty. Such 


a process is virtually never observed in 
actuality. This does not mean that this 
process is impossible, only that it is ex- 
ceedingly improbable. What the back- 
ward movie shows could happen in actu- 
ality if all the molecules moved in an 
extremely special way.t. But itis exceed- 
ingly improbable that all the molecules 
would ever move in this special way; in- 
deed, it is as improbable as the occur- 
rence of a fluctuation where n = Ν (ina 
situation where the gas is in equilibrium 
throughout the entire box). 


A process is said to be irreversible if the time-reversed process (the 
one which would be observed in a movie played backward) is such 
that it would almost never occur in actuality. But all macroscopic 
systems not in equilibrium tend to approach equilibrium, i.e., situa- 


tions of greatest randomness. 


Hence we see that all such systems 


exhibit irreversible behavior. Since in everyday life we are constantly 
surrounded by systems which are not in equilibrium, it becomes clear 
why time seems to have an unambiguous direction which allows us to 
distinguish clearly the past from the future. Thus we expect people 
to be born, grow up, and die. We never see the time-reversed process 
(in principle possible, but fantastically unlikely) where someone rises 
from his grave, grows progressively younger, and disappears into his 
mother’s womb. 

Note that there is nothing intrinsic in the laws of motion of the par- 
ticles of a system which gives time a preferred direction. Indeed, 
suppose that one took a movie of the isolated gas in equilibrium, as 
shown in Fig. 1.4 (or considered the time dependence of the number 
n of molecules in one half of the box, as indicated in Fig. 1.5). Look- 


ing at this movie projected on a screen, we would have no way of 


} Indeed, consider the molecules at some time th after they have become uniformly dis- 
tributed throughout the box. Now suppose that, at some subsequent time fz, each mole- 
cule would again be in exactly the same position as at the time t; and would have a velocity 
of equal magnitude but precisely opposite direction. Then each molecule would retrace 
its path in time. The gas would thus reconcentrate itself in the left half of the box. 
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telling whether it was running through the projector forward or back- 


+ ward. The preferred direction of time arises only when one deals with 


stem which is somehow known to be in a 


an isolated macroscopic 


very special nonrandom situation at a specified time t;. If the system 
has been left undisturbed for a very long time and got into this situa- 


tion as a result of a very rare spontaneous fluctuation in equilibrium, 
there is indeed nothing special about the direction of time. As already 
pointed out in connection with the peak X in Fig. 1.8, the system then 
tends to change toward the most random situation as time goes either 
forward or backward (i.e., both when the movie is played forward, 
and when it is played backward starting at the time 1). The only 
other way the system can have got into the special nonrandom situa- 
tion at the time ty is by virtue of interaction with some other system 
at some prior time. But in this case a specific direction of time is sin- 
gled out by the knowledge that, before being left undisturbed, the 
system was made to interact with some other system at some time 
before the time ty. 

Finally it is worth pointing out that the irreversibility of spontane- 
ously occurring processes is a matter of degree. The irreversibility 
becomes more pronounced to the extent that the system contains many 
particles since the occurrence of an orderly situation then becomes 
increasingly unlikely compared to the occurrence of a random one. 


Example 


Consider a box which contains only asin- not really tell with appreciable certainty 
gle molecule moving around and colliding — whether the film is being played forward 
elastically with the walls. If one took a or backward. (See Fig. 1.15.) But if 
movie of this system and then watched Ν = 40, this kind of process would very 
its projection on a screen, there would — rarely occur in actuality as a result of a 
never be any way of telling whether the spontaneous fluctuation. (On the aver- 
movie is being played forward or back- age, only 1 out of 2% = 240 Ξ- 1012 


ward through the projector. 

Consider now a box which contains Ν᾽ 
molecules of an ideal gas. Suppose that 
8 movie of this gas is projected on a 
screen and portrays a process in which 
the molecules of the gas, originally dis- 
tributed uniformly throughout the box, 
all become concentrated in the left half 
of the box. What could we conclude? If 
N= 4, this kind of process could, in ac- 
tuality, occur relatively frequently as a 
result of a spontaneous fluctuation. (On 
the average, 1 out of every 16 frames of 
‘the film would show all the molecules in 
the left half of the box.) Hence we can- 


frames of the film would show all the 
molecules in the left half of the box.) It 
is much more likely that the film is being 
played backward and portrays the result 
of a prior intervention, e.g., the removal 
of 8 partition which had previously con- 
fined the molecules in the left half of the 
box. (See Fig. 1.17.) For an ordinary 
gas where N ~ 1020, spontaneous fluc- 
tuations of the kind seen on the screen 
would almost never occur in actuality. 
One could then be almost completely cer- 
tain that the film is being played back- 
ward. 
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Fig. 1.16 Construction of a possible past history for Fig. 1.16. 
The pictures on this left-hand page were computed by starting 
with all the particles in the left half of the box in the positions 
shown in the frame j = 0 on the right-hand page and reversing 
the directions of all the assumed particle velocities in this frame. 
‘The resulting evolution of the system in time is then shown by 
the sequence of frames read in the order j = 0, —1, —2,..- 
15. The short line segment emanating from each particle in- 
dicates the direction of its velocity in this case. 


If the velocity of every particle on this left page is now imag- 
ined to be reversed in direction, then the sequence of frames in 
the order j = —15, —14,..., —1, 0, 1,..., 14 (extending 
over both left and right pages) represents a possible motion of 
the particles in time. This motion, starting from the very special 
initial situation prevailing in the frame j = —15 by virtue of the 
way it was constructed, then results in a fluctuation where all 
the particles are found in the left half of the box in the frame 
Ξ 9. 
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Fig. 1.16 Computer-made pictures showing 4 particles in a box. 
The pictures were constructed by starting with the Special situa- 
tion where all the particles are in the left half of the box in the 
positions shown in the frame j = 0 and are given some arbitrary 
assumed velocities. The resulting evolution of the system in 


time is then shown by the sequence of frames} = 0,1,2,..., 14. 
‘The number of particles located in each half of the box is printed 
directly beneath that half. The short line segment emanat- 
ing from each particle indicates the direction of the particle’s 


velocity. 
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Fig, 1.17 Construction of a possible past history for Fig. 1.18. 
The pictures on this left-hand page were computed by starting 
with all the particles in the left half of the box in the positions 
shown in the frame 7 = Ὁ on the right-hand page and reversing 
the directions of all the assumed particle velocities in this frame. 
The resulting evolution of the system in time is then shown by the 
sequence of frames read in the order j = 0, —1, —2,..., —15. 
No velocities are indicated. 


If the velocity of every particle on this left page is now imag- 
ined to be reversed in direction, then the sequence of frames in 
the order j = —15, —14,..., —1,0,1,..., 14 (extending over 
both left and right pages) represents a possible motion of the par- 
ticles in time. This motion, starting from the very special initial 
situation prevailing in the frame j = —15 by virtue of the way 
it was constructed, then results in a fluctuation where all the par- 
ticles are found in the left half of the box in the frame j = 0. 
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Fig. 1.18 Computer-made pictures showing 40 particles in a 
box. The pictures were constructed by starting with the special 
situation where all the particles are in the left half of the box in 
the positions shown in the frame j = 0 and are given some arbi- 


trary assumed velocities. The resulting evolution of the system in 
time is then shown by the sequence of frames j7=0,1,2,...,14. 
The number of particles located in each half of the box is printed 
directly beneath that half. No velocities are indicated, 
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Fig. 1.19 The number n of particles in the 
left half of the box as a function of the frame 
index j or the time ὁ = ἐτρ. The number n in 
the jth frame is indicated by a horizontal line 
extending from j to j - 1. The graphs de- 
scribe Figs. 1.15 and 1.16 for N = 4 particles, 
and Figs. 1.17 and 1.18 for N = 40 particles, 
but contain information about more frames 
than were shown there. The right half of each 
graph shows the approach of the system to 
equilibrium. The entire domain of each graph 
shows the occurrence of a rare fluctuation 
which might occur in an equilibrium situation. 


ἢ. τ γα τις, 
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Fig. 1.20 The relative number n/N of parti- 
cles in the left half of the box as a function of 
the frame index j or the time t = το: The 
information presented is otherwise the same as 
that in Fig, 1.19. 
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Fig. 1.21 A simple system of particles, each 
| having spin ὁ. Each spin can point either up 
or down. 
» 
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1.3. Further Illustrations 

By thinking in detail about the simple case of the ideal gas of N mole- 
cules, we have grappled with all the essential problems involved in 
understanding systems consisting of very many particles. Indeed, 
most of the remainder of this book will consist merely of the systematic 
elaboration and refinement of the ideas which we have already dis- 
cussed. To begin, let us illustrate the universal applicability of the 
basic concepts which we have introduced by considering briefly a few 
further examples of simple macroscopic systems. 


Ideal system of N spins 

Consider a system of N particles each of which has a “spin }” and an 
associated magnetic moment of magnitude po. The particles might 
be electrons, atoms having one unpaired electron, or nuclei such as 
protons. The concept of spin must be described in terms of quantum 
ideas. The statement that a particle has spin } thus implies that a 
measurement of the component (along a specified direction) of the spin 
angular momentum of the particle can have only two discrete possible 
outcomes: The measured component can be either +4 or — (where 
h denotes Planck’s constant divided by 27), i.e., the spin can be said 
to point either parallel or antiparallel to the specified direction. Cor- 
respondingly, the component (along the specified direction) of the 
magnetic moment of the particle can be either +p or --μο, i.e., the 
magnetic moment can also be said to point either parallel or antipar- 
allel to the specified direction. For the sake of simplicity, we shall 
designate these two possible orientations as “up” or “down,” respec- 
tively. 

The system of N particles with spin 4 is thus quite similar to a col- 
lection of N bar magnets, each having a magnetic moment μὸ which 
can point either up or down. For the sake of simplicity, we may re- 
gard the particles as essentially fixed in position, as they would be if 
they were atoms located at the lattice sites of a solid.{ We shall call 
the system of spins ideal if the interaction between the spins is almost 
negligible. (This is the case if the average distance between the par- 
ticles with spin is so large that the magnetic field produced by one 
moment at the position of another moment is small enough to be nearly 
negligible.) 


+ The magnetic moment of a particle may be antiparallel to its spin angular momentum. 
(This is the usual situation if the particle is negatively charged.) In this case the magnetic 
moment points down when the spin points up, and vice versa. 

=» {If the particles are free to move in space, their translational motion can usually be 
treated separately from the orientation of their spins. 


(2) 
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The ideal system of N spins has been described entirely in terms of 
quantum mechanics, but is otherwise completely analogous to the ideal 
gas of N molecules. In the case of the gas, each molecule moves about 
and collides occasionally with other molecules; hence it is found some- 
times in the left and sometimes in the right half of the box. In the 
case of the system of spins, each magnetic moment interacts slightly 
with the other magnetic moments, so that its orientation occasionally 
changes; hence each magnetic moment is found to point sometimes up 
and sometimes down. In the case of the isolated ideal gas in equilib- 
rium, each molecule is equally likely to be found in either the left or 
right half of the box. Similarly, in the case of the isolated system of 
spins in equilibrium in the absence of any externally applied magnetic 
field, each magnetic moment is equally likely to be found pointing 
either up or down. We can denote by n the number of spins pointing 
up, and by n’ the number of spins pointing down. In equilibrium the 
most random situation where n =n’ ~4N thus occurs most fre- 
quently, while fluctuations where n differs appreciably from $N occur 
very rarely. Indeed, when N is large, nonrandom situations where 
n differs appreciably from 4N occur almost always as a result of prior 
interaction of the isolated system of spins with some other system. 


Distribution of energy in an ideal gas 

Consider again the isolated ideal gas of N molecules. We reached the 
general conclusion that the time-independent equilibrium situation 
attained by the system after a sufficiently long time corresponds to the 
most random distribution of the molecules. In our previous discussion 
we focused attention solely on the positions of the molecules. We 
then saw that the equilibrium of the gas corresponds to the most ran- 
dom distribution of the molecules in space, i.e., to the essentially uni- 
form distribution of the molecules throughout the fixed volume of the 
box. But what can we say about the velocities of the molecules? Here 
it is useful to recall the fundamental mechanical principle that the total 
energy E of the gas must remain constant since the gas is an isolated 
system. This total energy E is equal to the sum of the energies of the 
individual gas molecules, since the potential energy of interaction be- 
tween molecules is negligible. Hence the basic question becomes: 
How is the fixed total energy of the gas distributed over the individual 
molecules? (If a molecule is monatomic, its energy € is, of course, 
merely its kinetic energy ε = 4mv?, where m is its mass and v its 
velocity.) It is possible that one group of molecules might have very 
high energies while another group might have very low energies. But 


(a) 


᾿ (b) 


(e) 


Fig. 1.238 Oscillation of a pendulum immersed 
ina gas. The pendulum is shown successively 
(a) soon after it is set into oscillation, (b) a short 
time thereafter, and (c) after a very long time. 
A movie played backward would show the pic- 
tures in the reverse order (c), (b), (a). 
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this kind of situation is quite special and would not persist as the mole- 
cules collide with each other and thus exchange energy. The time- 
independent equilibrium situation which is ultimately attained corre- 
sponds therefore to the most random distribution of the total energy 
of the gas over all the molecules. Each molecule has then, on the 
average, the same energy and thus also the same speed.{ In addition, 
since there is no preferred direction in space, the most random sit- 
uation of the gas is that where the velocity of each molecule is equally 
likely to point in any direction. 


Pendulum oscillating in a gas 

Consider a pendulum which is set oscillating in a box containing an 
ideal gas. If the gas were not present, the pendulum would continue 
oscillating indefinitely with no change in amplitude. (We neglect fric- 
tional effects that might arise at the point of support of the pendulum.) 
But in the presence of the gas, the situation is quite different. The 
molecules of the gas collide constantly with the pendulum bob. In 
each such collision energy is transferred from the pendulum bob to a 
molecule, or vice versa. What is the net effect of these collisions? 
Again this question can be answered by our familiar general argument 
without the need to consider the collisions in detail.{_ The energy ἔν 
(kinetic plus potential) of the pendulum bob plus the total energy E, 
of all the gas molecules must remain constant since the total system 
is isolated (if one includes the earth which provides the gravitational 
attraction). If the energy of the bob were transferred to the gas mole- 
cules, it could be distributed over these many molecules in many dif- 
ferent ways instead of remaining entirely associated with the bob. A 
much more random situation of the system would thus result. Since 
an isolated system tends to approach its most random situation, the 
pendulum gradually transfers practically all its energy to the gas mole- 
cules and thus oscillates with ever-decreasing amplitude. This is again 
a typical irreversible process. After the final equilibrium situation has 
been reached, the pendulum hangs vertically, except for very small 
oscillations about this position. 


} This does not mean that each molecule has the same energy at any one time. The 
energy of any one molecule fluctuates quite appreciably in the course of time as a result 
of its collisions with other molecules. But when each molecule is observed over a suffi- 
ciently long time interval τ, its average energy over that time interval is the same as that 
of any other molecule. 

} A detailed analysis would argue that the pendulum bob suffers more collisions per unit 
time with the molecules located on the side toward which the bob moves than with the 
molecules located on the other side. As a result, collisions in which the bob loses energy 
to a molecule are more frequent than collisions in which it gains energy from a molecule. 


ων) 
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Note one further point of interest. In the initial nonrandom situa- 
tion, where the pendulum bob has a large amount of energy associated 
with it, this energy can be exploited to do useful work on a macro- 
scopic scale. For example, the pendulum bob could be made to hit 
a nail so as to drive it some distance into a Piece of wood. After the 
final equilibrium has been reached, the energy of the pendulum bob 
has not been lost; it has merely become redistributed over the many 
molecules of the gas. But there is now no easy way to use this energy 
to do the work necessary to drive the nail into the wood. Indeed, this 


would necessitate some method for concentrating energy, randomly ~ 


distributed over the many gas molecules moving in many directions, 
50 that it can exert a net force in only one particular direction over an 
appreciable distance. 


1.4. Properties of the Equilibrium Situation 
Simplicity of the equilibrium situation 


The discussion of the preceding sections shows that the equilibrium 
situation of a macroscopic system is particularly simple. The reasons 
are the following: 


(i) The macrostate of a system in equilibrium is time-independent, 
except for ever-present fluctuations, Quite generally, the macrostate 
of a system can be described by certain macroscopic parameters, i.e., 
parameters which characterize the properties of the system on a large 
scale. (For example, the number n of molecules located in the left half 
of a box of gas is such a macroscopic parameter.) When the system 
is in equilibrium, the average values of all its macroscopic parameters 
remain constant in time, although the parameters themselves may ex- 
hibit fluctuations (ordinarily quite small) about their average values. 
The equilibrium situation of a system is, therefore, simpler to treat 
than the more general nonequilibrium case where some macroscopic 
parameters of the system tend to change in time. 


(i) The macrostate of a system in equilibrium is, except for fluctua- 
tions, the most random macrostate of the system under the specified 
conditions. The system in equilibrium is thus characterized in a 


unigue way. In particular, this has the following implications: 


L Nail 


Fig. 1.23 Arrangement in which the pendu- 
lum bob can be made to strike a nail and thus 
do work on it by driving it intoa piece of wood, 
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ὃς (a) The equilibrium macrostate of a system is independent of its 
past history. For example, consider an isolated gas of N’ molecules in 


a box. These molecules may originally have been confined by a par- 
tition to one half of the box-or to one quarter of the box (the total 
energy of the molecules being assumed the same in each case). Rut 
after the partition is removed and equilibrium has been attained, the 
macrostate of the gas is the same in both cases; it corresponds merely 
to the uniform distribution of all the molecules through the entire box. 


(b) The equilibrium macrostate of a system can be completely speci- 
fied by very few macroscopic parameters. For example, ‘consider 
again the isolated gas of N identical molecules in a box. Suppose that 
the volume of the box is V, while the constant total energy of all the 
molecules is E. If the gas is in equilibrium and thus known to be in 
its most random situation, then the molecules must be uniformly dis- 
tributed throughout the volume V and must, on the average, share 
equally the total energy E available to them. A knowledge of the 
macroscopic parameters V and E is, therefore, sufficient to conclude 
that the average number ji, of molecules in any subvolume V, of the 
box is i, = N(V,/V), and that the average energy € per molecule is 
@ = E/N. If the gas were not in equilibrium, the situation would of 
course be much more complicated. The distribution of molecules 
would ordinarily be highly nonuniform and a mere knowledge of the 
total number N of molecules in the box would thus be completely in- 
sufficient for determining the average number i, of molecules in any 
given subvolume V, of the box. 


Observability of fluctuations 

Consider 8 macroscopic parameter describing a system consisting of 
many particles. If the number of particles in the system is large, the 
relative magnitude of the fluctuations exhibited by the parameter is 
ordinarily very small. Indeed, it is often so small as to be utterly neg- 
ligible compared to the average value of the parameter. As a result, 
we remain usually unaware of the existence of fluctuations when we 
are dealing with large macroscopic systems. On the other hand, the 
ever-present fluctuations may be readily observed and may become 
of great practical importance if the macroscopic system under con- 
sideration is fairly small or if our methods of observation are quite sen- 
sitive. Several examples will serve to illustrate these comments. 


Gy 


See. 1.4 


Density fluctuations in a gas 


Consider an ideal gas which is in equilib- 
rium and consists of a large number N of 
molecules confined within a box of vol- 
ume V. Focus attention on the number 
n, of molecules located within some spec- 
ified subvolume V, inside the box. This 
number n, fluctuates in time about an 
average value 


the magnitude of its fluctuation at any 
time being given by the difference 


An, =n -- jx 


lf we consider the left half of the box as 
the part of interest, V, = ἐν δπαῆ, = 4N. 
When V, is large, the average number a, 
of molecules is also large. In accordance 
with our discussion of Sec. 1.1, the only 
fluctuations which occur with appreciable 
frequency are then those sufficiently 
small so that |An,| <ii,. 

On the other hand, suppose that we 
wanted to investigate the scattering of 
light by a substance. Then we would be 
interested in knowing what happens in a 


Fluctuations of a torsion pendulum 


Consider a thin fiber stretched between 
two supports (or suspended from one 
support under the influence of gravity) 
and carrying an attached mirror. When 
the mirror turns through a small angle, 
the twisted fiber provides a restoring 
torque. The mirror is thus capable of 
performing small angular oscillations and 
constitutes accordingly a torsion pendu- 
lum. Since the restoring torque of a thin 
fiber can be made very small and since 
8 beam of light reflected from the mirror 
provides a very effective way of detecting 
smail angular deflections of the mirror, 
@ torsion fiber is commonly used for 
very sensitive measurements of small 
torques. For example, it may be recalied 
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volume element V, having linear dimen- 
sions of the order of the wavelength of 
light. [Since the wavelength of visible 
light (about 5 x 10-® cm) is much larger 
than atomic dimensions, such a volume 
element is still macroscopic in size, al- 
though it is small.] 
molecules in every such volume element 
were the same (as is very nearly the case 
in a solid such as glass), then the sub- 
stance under consideration would be spa- 
tially uniform and would merely refract 
a beam of light without scattering it, But 
in the case of our ideal gas, the average 
number ji, of molecules in a volume as 
small as V, is quite small and fluctuations 
An, in the number n, of molecules within 
V, are no longer negligible compared to 
jy. The gas can, therefore, be expected 
to scatter light to an appreciable extent. 
Indeed, the fact that the sky does not 
look black is due to the fact that light 
from the sun is scattered by the gas mole- 
cules of the atmosphere. The blue color 
of the sky thus provides visible evidence 
for the importance of fluctuations, 


that a torsion pendulum was used by Cav- 
endish to measure the universal constant 
of gravitation and by Coulomb to measure 
the electrostatic force between charged 
bodies. 

When a sensitive torsion pendulum is 
in equilibrium, its mirror is not perfectly 
Still, but can be seen to perform erratic 
angular oscillations about its average 
equilibrium orientation. (The situation 
is analogous to that of the ordinary pen- 
dulum, discussed in Sec. 1.3, which ex- 
hibits in equilibrium small fluctuations 
about its vertical position.) These fluc: 4. 
tuations are ordinarily caused by the 
random impacts of the surrounding air 
molecules on the mirror. 


If the number of .- 


Fig, 1.24 Torsion pendulum formed by a mir- 
ror mounted ona thin fiber. A beam of light 
reflected from the mirror mdicates the angle 
of rotation φ of the mirror. 
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Fig, 1.26 A resistor R connected to the input 
terminals of a sensitive amplifier whose output 
is displayed on an oscilloscope. 
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{The fluctuations of the mirror would 
be modified, but would not disappear, 
even if all the molecules of the surround- 
ing gas were removed. In that case the 
total energy of the torsion pendulum 
would still consist of two parts, the en- 
ergy. E, due to the angular velocity of the 
mirror moving as 8 whole, plus the en- 
ergy E, due to the internal motion οἵ "δὴ! 
the atoms of the mirror and fiber. (The 
atoms are free to perform small vibra- 


Brownian motion of a particle oy 


Small solid particles, about 10-* cm in 
size, can be introduced into a drop of 
liquid and then observed under a micro- 
scope. Such particles are not at rest, 
but are seen to be constantly moving 
about in a highly irregular way. This 
phenomenon is called Brownian motion 
because it was first observed in the last 
century by an English botanist named 
Brown. He did not understand the origin 
of the phenomenon. It remained for Ein- 
stein in 1905 to give the correct explana- 
tion in terms of random fluctuations to 


tions about their sites in the solids which 
constitute the mirror “and fiber.) Al- 
though. the total enerey Ἐν + Εἰ of the 
torsion pendulum is constant, fluctua- 
tions do occur in the way this energy is 
shared between E.and £,. Any fluctua 
tion in which E, gains energy at the ex- 
pense of the internal motion of the atoms 
thus results in an increased angular ve- 
locity of the mirror, and vice versa.] 


be expected in equilibrium. A solid par- 
ticle is subject to a fluctuating net force 
due to the many random collisions of the 
particle with the molecules of the liquid. 
Since the particle is small, the number 
of molecules with which it collides per 
unit time is relatively small and accord: 
ingly fluctuates appreciably. In addition, 
the mass of the particle is so smail that 
any collision has a noticeable effect on 
the particle. The resulting random mo- 
tion of the particle thus becomes large 
enough to be observable. 


Fig, 1.35 Brownian motion of a solid particle, 10-4 cm in 
diameter. suspended in water and observed through a micro- 
scope. The three-dimensional motion of such a particle, as seen 
projected in the horizontal plane of the field of view of the 
microscope, is shown by this diagram where the lines join con- 
secutive positions of the particle as observed at 30-second inter- 
vals, [The data are from J. Perrin, Atoms, p. 115 (D. Van 
Nostrand Company, Inc., Princeton, N.J., 1916} 


Voltage fluctuations across a resistor 


Η an electrical resistor is connected 
across the input terminals of a sensitive 
electronic amplifier, the output of the 
amplifier is observed to exhibit random 
voltage fluctuations. Neglecting noise 
originating in the amplifier itself, the 
reason is the random Brownian mo- 
tion of the electrons in the resistor. Sup- 
pose, for example, that this random 


ὃν) 


motion leads to a fluctuation where the 
number of electrons is greater in one 
half of the resistor than inthe other. The 
resulting charge difference then leads to = 
an electric field in the resistor, and hence 
to a potential difference across its ends. 
Variations in this potential difference + 
thus give rise to the voltage fluctuations 
amplified by the electronic instrument. 
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The existence of fluctuations can have important practical conse- 
quences. This is particularly true whenever one is interested in 
measuring small effects or signals, since these may be obscured by the 
intrinsic fluctuations ever-present in the measuring instrument. (These 


fluctuations are then said to constitute noise since their presence makes 
measurements difficult.) For example, it is difficult to use a torsion 
fiber to measure a torque which is so small that the angular deflection 
produced by it is less than the magnitude of the intrinsic fluctuations 
in angular position displayed by the mirror. Similarly, in the case of 
the resistor connected to the amplifier, it is difficult to measure an 
applied voltage across this resistor if this voltage is smaller than the 
magnitude of the intrinsic voltage fluctuations always present across 
the resistor. + 5 


1.5 Heat and Temperature 


Macroscopic systems which are not isolated can interact and thus 
exchange energy. One obvious way in which this can happen is if one 
system does macroscopically recognizable work upon some other sys- 
tem. For example, in Fig. 1.28 the compressed spring A’ exerts a net 
force on the piston confining the gas A. Similarly, in Fig. 1.29, the 
compressed gas A’ exerts a net force on the piston confining the gas A. 
When the piston is released and moves through some macroscopic 
distance, the force exerted by A’ performs some work{ on the system A. 

It is, however, quite possible that two macroscopic systems can in- 
teract under circumstances where no macroscopic work is done. This 
type of interaction, which we shall call thermal interaction, occurs 
because energy can be transferred from one system to the other sys- 
tem on an atomic scale. The energy thus transferred is called heat. 
Imagine that the piston in Fig. 1.29 is clamped in position so that it can- 
not move. In this case one system cannot do macroscopic work upon 
the other, irrespective of the net force exerted on the piston. On the 
other hand, the atoms of system A do interact (or collide) with each 
other almost constantly and thus exchange energy among themselves.§ 


4 Typically, voltages of the order of one microvolt or less may become difficult to meas- 
ure. By averaging the measurement over a sufficiently long time, one may, however, dis- 
criminate against the random fluctuations in favor of the applied signal which does not 
fluctuate in time. 

} The term’ work is used here in its usual sense familiar from mechanics and is thus 
defined basically as a force multiplied by the distance through which it acts. 

ΤῈ each molecule of the gas consists of more than one atom, different molecules can 
exchange energy by colliding with each other; furthermore, the energy of a single molecule 
can also be exchanged between its constituent atoms as a result of interaction between 
them. 


Fig. 1.27 Actual photograph of the noise out- 
put voltage displayed on an oscilloscope in the 
experimental arrangement of Fig, 1.26. (Pho- 
tograph by courtesy of Dr. F. W. Wright, Jr, 
University of California, Berkeley.) 


Fig. 1.28 The compressed spring A’ does 
work on the gas A when the piston moves 
through some net macroscopic distance. 
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Fig. 1.29 The compressed gas A’ does work 
on the gas A when the piston moves through 
some net macroscopic distance. 
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Fig, 1.30 Schematic diagram showing two 
general systems A and A’ in thermal contact 
with each other. 
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Similarly, the atoms of system A’ exchange energy among themselves. 
At the piston, which forms the boundary between the gases A and A’, 
the atoms of A interact with the atoms of the piston, which then inter- 
act among themselves and then in turn interact with the atoms of A’. 
Hence energy can be transferred from A to A’ (or from A’ to A) as a 
result of many successive interactions between the atoms of these 
systems. 

Consider then any two systems A and A’ in thermal interaction with 
each other. For example, the two systems might be the two gases A 
and A’ just discussed; or A might be a copper block immersed in a 
system A’ consisting of a container filled with water. Let us denote 
by E the energy of system A (i.e., the total energy, kinetic plus poten- 
tial, of all the atoms in A); similarly let us denote by E’ the energy of 
system A’, Since the combined system A* =A + A’ consisting of 
both A and A’ is supposed to be insulated, the total energy 


E + E’ = constant.t (8) 


The question arises, however, how this total energy is actually dis- 
tributed between the systems A and A’. In particular, suppose that 
the systems A and A’ are in equilibrium with each other, i.e., that the 
combined system A* is in equilibrium. Except for small fluctuations, 
this equilibrium situation of A* must then correspond to the most ran- 
dom distribution of energy in this system. 

Let us first discuss the simple situation where the systems A and A’ 
are two ideal gases consisting of molecules of the same kind. [For 
example, both A and A’ might consist of nitrogen (N2) molecules.] In 
this case the most random situation of the combined system A* is 
clearly that where its total energy E + E’ is shared indiscriminately 
among all the identical molecules of A*. Each molecule of A and A’ 
should then have the same average energy. In particular, the average 
energy € of a molecule of gas A should be the same as the average 
energy ε΄ of a molecule of gas A’, ie., 


in equilibrium, Se, (9) 


Of course, if there are N molecules in gas A and N’ molecules in gas A’, 
then 


ron ee oe =. (10) 


δι 
i 
ΕΣ 


}In the case of the system of Fig. 1.29, we assume for the sake of simplicity that the 
container walls and the piston are so thin that their energies are negligibly small compared 
to the energies of the gases. 
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Hence the condition (9) can also be written in the form 


Es ek 
Now 
Suppose that the gases A and A’ are initially separated from each 
other and separately in equilibrium. We denote their energies under 
these circumstances by Εἰ and E;’, respectively. Imagine now that the 
systems A and A’ are placed in contact with each other so that they 
are free to exchange energy by thermal interaction. There are two 
cases which may arise: 
(i) Ordinarily, the initial energies Εἰ and Εἰ of the systems are such 
that the average initial energy ει = E;/N of a molecule of A is not the 
same as the average initial energy € = E;'/N’ of a molecule of A’; i.e., 


ordinarily, εἰ τέ εἰ. (1) 


In this case the initial distribution of energy in the combined system 
A* is quite nonrandom and will not persist in time. Instead, the sys- 
tems A anid A’ will exchange energy until they ultimately attain the 
equilibrium situation corresponding to the most random distribution 
of energy, that where the average energy per molecule is the same in 
both systems. The energies Ey and Ej’ of the systems A and A’ in the 
final equilibrium situation must then be such that 


Ε,. Εἰ 


x= 5b (12) 


& =e or 


In the interaction process leading to the final equilibrium situation 
the system with the smaller average initial energy per molecule thus 
gains energy, while the system with the larger average initial energy 
per molecule loses energy. Of course, the total energy of the isolated 
combined system A* remains constant so that 

E/ + E; = Ej + Ej. 
Thus AE + AE’ = 0, (13) 
or Q+Q0=0, (14) 


where we have written 


(15) 


and 


The quantity Q is called the heat absorbed by A in the interaction 
process and is defined as the increase in energy of A resulting from 
the thermal interaction process, A similar definition holds for the heat 
Θ΄ absorbed by A’. 


Ι ἘΠ ΞΕ; 
Ϊ E,+E,=E, +E; 
i Q=E,-E,=E,-E; 


f (b) 
f 


Fig. 1.31 Two gases A and A’, consisting of 
molecules of the same kind, are initially sepa- 
rated from each other in (a). They are then 
brought into thermal contact with each other 
in (b) and are allowed to exchange heat until 
they reach equilibrium. The energy of a gas 
is denoted by E, the average energy per mole- 
cule by é. 
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Note that the heat Q = AE absorbed by a system may be either 
positive or negative. Indeed, in the thermal interaction between two 
systems, one loses energy while the other gains energy; i.e., in (14) 
either ( is positive and Q’ is negative, or vice versa. By definition, the 
system which gains energy by absorbing a positive amount of heat is 
said to be the colder system; on the other hand, the system which loses 
energy by absorbing a negative amount of heat (i.e., by “giving off” a 
positive amount of heat) is said to be the warmer or hotter system. 

(ii) A special case may arise where the initial energies Εἰ and E;’ of 
the systems A and A’ are such that the average energy of a molecule 
of A happens to be the same as that of a molecule of A’; 1.6., it may 
happen that 


a= εὐ. (16) 


In this case, when the systems A and A’ are brought into thermal con- 
tact with each other, they automatically give rise to the most random 
situation of the combined system A*. Thus the systems remain in 
equilibrium and there occurs no net transfer of energy (or heat) be- 
tween the systems. 


Temperature 

Consider now the general case of thermal interaction between two 
systems A and A’, These systems might be different gases whose mole- 
cules have different masses or consist of different kinds of atoms; one 
or both of the systems might also be liquids or solids. Although the 
conservation of energy (8) is still valid, it now becomes more difficult 
to characterize the equilibrium situation corresponding to the most 
random distribution of the energy over all the atoms of the combined 
system A + A’. Qualitatively, most of our preceding comments con- 
cerning the case of two similar gases ought, however, still to be appli- 
cable. What we should then expect (and shall explicitly show later on) 
is the following: Each system, such as A, is characterized by a parame- 
ter T (conventionally called its “absolute temperature”) which is 
related to the average energy per atom in the system. In the equi- 
librium situation corresponding to the most random distribution of 
energy we then expect, analogously to (9), a condition of the form 


T=T. (17) 


It is not possible to define the concept of absolute temperature more 
precisely until we have given a more precise definition of the concept 
of randomness (applicable to the distribution of energy over unlike 
atoms). It is, however, easy to introduce the concept of a temperature 
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(not the “absolute temperature”) measured with a particular thermom- 
eter. By a thermometer we mean any small macroscopic system M 
arranged so that only one of its macroscopic parameters changes when 
the system M absorbs or gives off heat. This parameter is called the 
thermometric parameter of the thermometer and will be denoted by 
the Greek letter 6. For example, the familiar mercury or alcoho! ther- 
mometer is a special example of a thermometer. Here the length L 
of the column of liquid in the glass capillary tube of the thermometer 
changes when the energy of the liquid changes as a result of a transfer 
of heat. The thermometric parameter @ of this kind of thermometer 
is then the length L. If the thermometer M is placed in thermal con- 
tact with some other system A and allowed to come to equilibrium, 
its thermometric parameter 6 assumes some value 64. This value 64 
is called the temperature of the system A with respect to the particular 
thermometer M.t 

The usefulness of a thermometer is made apparent by the following 
considerations. Suppose that the thermometer M is placed in thermal 
contact first with a system A and then with a system B. In each case 
the thermometer is allowed to come to equilibrium; it then indicates 
the respective temperatures 64 and 6g. Two cases may arise: either 
64 Op or 04 = Oy. Itisa familiar experimental fact (which will later 
be justified theoretically) that if 04 ~ θ5, the systems A and B will 
exchange heat when brought into thermal contact with each other; 
but, if #4 = θ8, the systems will not exchange heat on thermal contact. 
The temperature @ of a system measured with respect to the particular 
thermometer is thus a very useful parameter characterizing the system, 
since a knowledge of the temperature allows the following predictive 
statement: If two systems are brought into thermal contact with each 
other, they will exchange heat if their temperatures are unequal and 
will not exchange heat if their temperatures are equal. 


1.6 Typical Magnitudes 

The preceding sections have shown us in broad outline how the behav- 
ior of macroscopic systems can be understood in terms of their con- 
stituent molecules or atoms. Our considerations have, however, been 
quite qualitative. To complete our preliminary orientation, we should 
also like to acquire some perspective about representative orders of 


i The system A is supposed to be much larger than the thermometer M so that it is dis- 
turbed negligibly by the small amount of energy gained or lost to the thermometer. Note 
also that, according to our definition, the temperature measured by a mercury thermometer 
is a Jength and is thus to be measured in units of centimeters. 


Fig. 1.32 Two different kinds of thermome- 
ters in thermal contact with a system consist- 
ing of a container filled with a liquid. One 
thermometer is a mercury thermometer; its 
thermometric parameter is the length L of the 
mercury column in the glass capillary tube. 
The other thermometer is an electrical resistor 
R made, for example, either of platinum wire 
or carbon; its thermometric parameter is its 
electrical resistance R (determined by passing 
asmall current 1 through the resistor and meas- 
uring the voltage V across it). 
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F sree sie Manometer 
Fig. 1.33 A gas whose average pressure pis 
measured by means of a manometer consisting 
of a U-tube filled with mercury. In order to 
be in mechanical equilibrium, the mercury 
levels must adjust themselves so that the mer- 
cury column of height h has a weight, per unit 
cross-sectional area, equal to the pressure ex- 
erted by the gas. 


es 


Fig. 1.34. Diagram illustrating collisions of gas 
molecules with an area A of a container wall. 
(The z axis points out of the paper.) 
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magnitude. For example, it would be worth knowing how fast a typi- 
cal molecule moves or how frequently it collides with other molecules. 
To answer such questions, we may again turn to the simple case of an 
ideal gas. 


Pressure of an ideal gas 

When a gas is confined within a container, the many collisions of the 
gas molecules with the walls of the container give rise to a net force 
on each element of area of the walls, The force per unit area is called 
the pressure p of the gas. The average pressure p exerted by the gas 
is readily measured, for example by means of a manometer. The gas 
pressure should be calculable in terms of molecular quantities; con- 
versely, it should then be possible to use the measured gas pressure 
to deduce the magnitudes of molecular quantities. We shall begin, 
therefore, by giving a simple approximate « calculation of the pressure 
exerted by an ideal gas. mates Ὁ 

To be specific, we shall consider an ideal gas of N molecules, each 
having a mass m. We suppose that the gas is in equilibrium and that 
it is confined within a box in the shape of a rectangular parallelepiped 
having a volume V. The number of molecules per unit volume is con- 
veniently denoted} by n= N/V. The edges of the box can be as- 
sumed to be parallel to a set of cartesian x, y, 5 axes, as shown in 
Fig, 1.34. 

‘We focus attention on a wall of the box, e.g,, the right wall which 
is perpendicular to the x axis. Let us first ask how many molecules 
collide with an area A of this wall during some short time interval ἡ. 
‘All the molecules do not have the same velocity at any one time. But 
since we are satisfied with approximate results, we can simplify our 
considerations by assuming that each molecule moves with the same 
speed, equal to its average speed δ. The molecules move, however, 
in random directions so that, on the average, one-third of them (or 4n 
molecules per unit volume) move predominantly along the x axis, one- 
third along the y axis, and one-third along the z axis. Of the ἐπ mole- 
cules per unit volume moving predominantly along the x axis, one-half 
(or $n molecules per unit volume) move in the +x direction toward 
the area A, while the remaining half move in the —x direction away 


med) - 


from the area A. Any molecule having a velocity directed predomi- ++ 


nantly in the +x direction travels during the short time { a distance 
Btin the +x direction. If such a molecule is located within a distance 


} The fact that the symbol n was previously used in ἃ different context to denote the 
number of molecules in one half of a box of gas should not lead to confusion. 
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Εἰ from the area A of the wall, it will thus strike this area in the time t; 
but if it lies further than a distance δὲ from the area A, it will not reach 
this area and thus will not collide with it.} Hence the average num- 
ber of molecules which collide with the area A during the time ὁ is 
simply equal to the average number of molecules which have a velocity 
predominantly in the +x direction and which are contained within 
the cylinder of area A and length tt. It is, therefore, obtained by mul- 
tiplying 4n (the average number of molecules per unit volume which 
have their velocity predominantly in the +x direction) by the volume 
Ait of the cylinder and is thus given by 


(gn)(Aot). 


If we divide this result by the area A and the time t, we obtain an 
approximate expression for 7.0, the average number of molecules strik- 
ing unit area of a wall per unit time (or the molecular flux density). 
Thus 


(8) 


Let us now calculate the average force exerted by the colliding 
molecules on a unit area of the wall. When such a molecule, moving 
predominantly in the +2 direction, strikes the wall, its kinetic energy 
4m? remains unchanged. (This must be true, at least on the average, 
since the gas is in equilibrium.) The magnitude of the momentum of 
the molecule must then, on the average, also remain unchanged; i.e., 
the molecule, approaching the right wall with momentum m@ in the 
+x direction, must have a momentum —mi in this direction after it 
rebounds from the wall. The +x component of momentum of the 
molecule changes then by an amount —mé — πιῦ = —2mi asa result 
of the collision with the wall. Correspondingly it follows, by the prin- 
ciple of conservation of momentum, that the wall gains in the collision 
an amount of momentum +2mié in the +x direction. But the average 
force exerted on the wall by the gas molecules is, by Newton’s second 
law, equal to the average rate of change of momentum experienced 
by the wall as a result of the molecular collisions. The average force 
exerted on unit area of the wall (i.e., the average pressure j on the 
wall) is then simply obtained by multiplication, i.e., 


i Since the time interval t can be considered arbitrarily short, much shorter than the 
average time between collisions of the molecules among themselves, collisions of the given 
molecule with other molecules are very unlikely to occur during the time { and can be 
disregarded. 
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the average momentum the average number of colli- 

p = |2mé gained by the wall 1. |sions experienced per unit tine) . 
in one molecular collision by a unit area of the wall 

Thus Pp = (2met)Fo ϑηιτ ) ἐπε) 


or (19) 
As would be expected, the pressure p is seen to be increased (i) if n is 
made large so that there are more molecules colliding with the walls, 
and (ii) if Ὁ is increased so that the molecules collide with the walls 
more frequently and give up more momentum per collision. 

Since the average kinetic energy € of a molecule is approximately 
given byt 


€®) = dn 


(20) 
the relation (19) can also be written as 
p= ane, (21) 


Note that (19) and (21) depend only on the number of molecules per 
unit volume, but make no reference to the nature of these molecules. 
These relations are thus equally valid, irrespective of whether the gas 
consists of molecules of He, Ne, Oz, Nz, or CHy. The average pres- 
sure of any ideal gas kept in a container of fixed volume thus provides 
a very direct measure of the average kinetic energy of a molecule of 
the gas. 


Numerical estimates 
Before making some numerical estimates, it will be useful to recall 
some important definitions. The mass m of an atom or molecule is 
conveniently expressed in terms of some standard mass unit mo. In 
accordance with present international convention (adopted in 1960 
and called the unified scale of atomic weights) this mass unit mo is 
defined in terms of the mass mc of an atom of the particular carbon 
isotope 130 by} 
__ me 
mo ==: 22) 
jae 03 (22) 

1 We neglect here the distinction between @, the average of a square, and δ, the square 
of an average. 

} We recall that a particular isotope is defined as an atom X with a uniquely specified 
nucleus; the symbol *X indicates that the atom has n nucleons (protons + neutrons) in its 
nucleus. Atoms having nuclei with different numbers of neutrons but the same number 
of protons are chemically alike since they have the same number of extranuclear electrons. 
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The mass of a °C atom is thus exactly equal to 12 mass units. The 
mass of a H atom is then approximately equal to one mass unit. 

The ratio of the mass m of an atom (or molecule) to the mass unit 
mo is called the atomic weight of the atom (or molecular weight of the 
molecule) and will be denoted by ». Thus 

= Ξο βὰς. έχνας. (23) 
The atomic weight of 12C is thus, by definition, equal to 12. 

A convenient macroscopic number of atoms (or molecules) is the 
number N, of atoms of mass mo which would have a total mass of 
1 gram: i.e., the number N, is defined by 


(24) 
Alternatively, this defining relation can be written in the form 
1 Cadicinn) mem nao wn) no 
Νι τς at (25) 
No ae JAG ν᾿ mmo m 


where we have used (23). This means that N, is also equal to the num- 
ber of molecules, of molecular weight μι which have a total mass of 
μι grams. The number Ν᾽ is called Avogadro's number. 

One mole of a certain kind of molecule (or atom) is defined as a 
quantity consisting of N, molecules (or atoms) of this kind. A mole of 
molecules of molecular weight has, therefore, a total mass of μ grams. 

The numerical value of Avogadro’s number is experimentally found 
to he 


Νὰ = (6.02252 + 0.00009) x 1058 molecules/mole. (26) 


(See the table of numerical constants at the end of the book.) 


Let us now use the relations (19) or total number N of molecules in the ex- 
(21) for the pressure of a gas to estimate perimental vessel is thus 
molecular quantities for nitrogen (Nz) 
gas, the main constituent of air. Atroom 
temperature and atmospheric pressure 
(105 dynes/cm?), the mass of Nz gas con- 
tained within a vessel having a volume of το that 
one liter (10° cm’) is experimentally 
found to be about 1.15 grams. Since the - Ν _ 247 ΧΊΟΣ 
atomic weight of a N atom is about 14, 105 
the molecular weight of @ No molecule = 2.5 X 1019 molecules/em3. (27) 
is 2x 14 = 28. It follows that 28 gm 
of No gas contains Avogadro's number Using (21), it then follows that the aver- 
Nz = 6.02 x 1053 of Ny molecules. The 886 kinetic energy of 8 No molecule is 
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Fig. 1.35 Diagram illustrating an encounter 
between two hard spheres of radius a. 


Fig. 1.36 Diagram illustrating the collisions 
suffered by a particular molecule A when it 
encounters another molecule A’ whose center 
is located within the volume swept out by the 
area σ of the imaginary disk carried by A. 
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m= 


4.65 χ 10-3 gm. 


rane (29) 


Ξε X10 erg, 428) Hence (20) implies that 


Since N, nitrogen molecules Lwhere N, is 
Avogadro's number) have « mass of 28 
gm, the massm of a single Ns molecule is ΟἹ 


Mean free path 

Focusing attention on a molecule in a gas at any instant of time, let 
us estimate the average distance | which this molecule travels before 
it collides with some other molecule in the gas. This distance / is 
called the mean free path of the molecule. To simplify matters, we 
can imagine that each molecule is spherical in shape and that the forces 
between any two molecules are similar to those between two hard 
spheres of radius a. This means that the molecules exert no force on 
each other as long as the separation R between their centers is greater 
than 2a, but that they exert extremely large forces on each other (i.¢., 
they collide) if R< 2a. Figure 1.35 illustrates an encounter between 
two such molecules. Here molecule A’ can be considered stationary 
as molecule A approaches it with some relative velocity V in such a 
way that the centers of the molecules would approach within a dis- 
tance b of each other if they remained undeflected. It is then appar- 
ent that the molecules will never collide if b > 2a, but that they will 
collide if b << 2a. Another way of expressing this geometrical rela- 
tionship is to imagine that molecule A carries with it a disk of radius 
2a (this disk being centered about the center of the molecule and ori- 
ented perpendicular to the velocity V). A collision between the two 
molecules will then occur only if the center of molecule A’ lies within 
the volume swept out by the disk carried by molecule A. 

The area o of the imaginary disk carried by a molecule is 


σ = 72a)? = 4a? (31) 
and is called the total scattering cross section for molecule-molecule 
collisions. The volume swept out by this disk as the molecule travels 


a distance ἰ is equal to ol. Suppose that this volume is such that it 
contains, on the average, one other molecule, i.e., that 


(ol )n = 1, 


where n is the number of molecules per unit volume. Then the dis- 
tance 1 is the average distance traveled by the molecule before suffer- 


ioe 
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ing a collision with another molecule, ice., it is the desired mean free 
path. Thus we get 


(32) 


As one would expect, the mean free path becomes long (i) if n is small, 
so that there are only few molecules with which a given molecule can 
collide, and (ii) if the molecular radius is small, so that molecules must 
approach each other quite closely before they collide. 


To estimate orders of magnitude, we ἘΞ Σ Ξ - -4΄ 
return to the previously discussed case no — (2.5 x 1019\(12 x 1071) 
of Nz gas at room temperature and at- 4, =e 89) 
mospheric pressure. The radius of 8 
molecule is of the order οἱ 10-8 cm, i.e., Note that Ia, i.e., the mean free path 

is much larger than the radius of a mole- 

a~ 10 em. cule. The molecules thus interact with 

each other infrequently enough so that 

it is a good approximation to treat the 

πα ~ 12 X 10-16 em? gas as ideal. On the other hand, the 

mean free path is very small compared 

Using the value (27) of n, (32) then yields ἰὸ the linear dimensions of the one-liter 
the estimate vessel containing the gas. 


Hence (31) gives for the cross section 


1.17 Important Problems of Macroscopic Physics 

Although the discussion of this chapter has been mostly qualitative, it 
has revealed the most significant features of macroscopic systems. We 
have, therefore, gained sufficient perspective to recognize some of the 
questions which we should ultimately like to explore and understand. 


Fundamental concepis 
Our first task must clearly be that of transforming our qualitative in- 
sights into precisely formulated theoretical concepts capable of making 


. guantitative predictions. For example. we have recognized that cer- 


tain situations of ἃ macroscopic system are more probable (or more 
random) than others. But precisely how does one assign a probability 
to a given macrostate of a system and how. does one measure its degree 
of randomness? This question is of all-pervasive importance. We also 
concluded that 
terized as being the most random situation of an isolated system. The 
problem of defining randommess in a precise general way thus occurs 
again. Indeed, this problem proved troublesome to us when we tried 
to discuss the case of two arbitrary systems in thermal contact with 


@ 


@+ 
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each other. We suspected that the equilibrium condition of maximum 
randomness should imply that some parameter T (measuring roughly 
the mean energy per atom in a system) ought to be the same for both 
systems. But since we did not know how to define randomness in the 
general case, we were unable to arrive at an unambiguous definition 
of this parameter T (which we called the “absolute temperature”) 
Thus we can pose the following basic question: How can we use prob- 
ability ideas to describe macroscopic systems in a systematic way so 
as to define concepts such as randomness or absolute temperature? 

In discussing the example of the pendulum in Sec. 1.3, we saw that 
it is not readily apparent how to transform energy randomly distrib- 
uted over many molecules to a less random form where it can do work 
by exerting a net macroscopic force over a macroscopic distance. This 
example illustrates questions of the most profound importance. In- 


. deed, to what extent is it possible to take energy distributed randomly 


over many molecules of a substance (such as coal or gasoline) and to 
transform it into a less random form where it can be used to move 
pistons against opposing forces? In other words, to what extent is it 
possible to build the steam engines or gasoline engines responsible for 
our industrial revolution? Similarly, to what extent is it possible to 
take the energy distributed randomly over many molecules of certain 
chemical compounds and to transform it into a less random form where 
it can be used to produce muscular contraction or the synthesis of 
highly ordered polymer molecules such as proteins? In other words, 
to what extent can chemical energy be used to make biological proc- 
esses possible? A good understanding of the concept of randomness 
should allow us to make significant statements about all these questions. 


Properties of systems in equilibrium 
Since macroscopic systems in equilibrium are particularly simple, their 
properties ought to be most readily amenable to quantitative discus- 
sion. There are, indeed, many equilibrium situations which are of 
great interest and importance. Let us mention some of the questions 
which are worthy of investigation. aie 

A homogeneous substance is one of the simplest systems whose 
equilibrium properties one may hope to calculate. For example. sup- 
pose that some particular fluid (gas or liquid) is in equilibrium at a 
given temperature. How does the magnitude of the pressure which 
it exerts depend on its temperature and on its volume? Or suppose 
that some substance contains a certain concentration of iron atoms, 
each of which has a definite magnetic moment. If this substance is 
at a given temperature and is located in a given magnetic field, what 


Ww) 
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is the value of its magnetization, ie., of its net magnetic moment per 
unit volume? How does this magnetization depend on the tempera- 
ture and on the magnetic field? Or suppose that a small amount of 
heat is added to some particular substance (which might be a liquid, 
a solid, or a gas). By what amount does its temperature increase? 

Not only can we ask questions about macroscopic parameters of a 
system in equilibrium; we can also inquire into the behavior of the 
atoms of which it consists. For example, consider a container of gas 
maintained at some given temperature. All the molecules of the gas 
do not have the same speed and we can ask what fraction of the mole- 
cules have a speed in any specified range. If we make a very small 
hole in the container, some of the molecules will escape through the 
hole into the surrounding vacuum where their speed can be directly 
measured and the theory compared with the experimental results, Or 
consider an empty container whose walls are maintained at some ele- 
vated temperature. Since the atoms in the walls emit electromagnetic 
radiation, the container itself is then filled with radiation (or with pho- 
tons) in equilibrium with the walls. What amount of energy of this 
electromagnetic radiation is concentrated in any given frequency 
range? If one makes a very small hole in the container through which 
some of the radiation escapes, then one can readily use a spectrometer 
to measure the amount of radiation emerging in any narrow frequency 
band and thus can compare the predictions with experimental results, 
This last problem is actually very important for understanding the 
radiation emitted by any hot body, whether it be the sun or the fila- 
ment of a light bulb. 

Another type of situation of great interest is that where chemical 
reactions can occur between various kinds of molecules. As a specific 
example, consider a container of volume V which is filled with carbon 
dioxide (COz) gas. It is possible to transform CO, molecules into 
carbon monoxide (CO) and oxygen (09) molecules, or vice versa, ac- 
cording to the chemical reaction 


2CO2 =2:2'CO 410s; (34) 


As the temperature of the container is raised, some of the COy mole- 
cules will dissociate into CO and Οἱ molecules. The container will 
then enclose a mixture of COz, CO, and Oy gases in equilibrium with 
each other. One would like to know how to calculate, from first prin- 
ciples, the relative number of COs, CO, and O» molecules thus present 
in equilibrium at any specified temperature. 

But even a simple substance consisting only of a single kind of mole- 
cule presents some intriguing problems. Such a substance can typi- 
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Fig, 1.37 A container filled with gas has a 
very small hole through which some molecules 
can escape into a surrounding vacuum, Col- 
limation of these emerging molecules by one 
or more slits yields a well-defined molecular 
beam. The distribution of velocities of the 
molecules in the beam is closely related to that 
of the molecules in the container. Molecular 
beams of this sort represent a very powerful 
method for studying nearly isolated atoms or 
molecules and have been used to perform some 
of the most fundamental experiments of mod- 
ern physics. 


ut 


— Water vapor 


— Liquid water 


Fig. 1.38 Liquid water and its gas form, water 
vapor, are here shown coexisting in equilibrium 
at some specified temperature. The pressure 
exerted by the vapor then has a unique value 
which depends only on the temperature. 
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cally exist in several different forms, or phases, such as gas, liquid, or 
solid. A specific example might be water which can exist in the form 
of water vapor, liquid water, or ice. Each such phase consists of the 
same kind of molecule (H2O, in the case of water). but the molecules 
are arranged differently. In the gas phase, the molecules are far apart 
from each other and thus move about almost independently of each 
other in random fashion. In the solid phase, on the other hand, the 
molecules are arranged in a very orderly manner. They are located 
in a regular crystal lattice near specific sites and are free merely to 
perform small oscillations about these sites. In the liquid phase the 
situation is intermediate, being neither as orderly as in the solid nor 
as random as in the gas. The molecules here are close to each other 
and always under each other’s strong influence, yet they are free to 
move past each other over long distances. The evidence for these 
molecular arrangements in the various phases comes largely from 
studies of x-ray scattering. 

It is well known that a substance changes from one phase to another 
phase at a very well-defined temperature (absorbing or giving off heat 
in the process). For example, water changes from the solid ice form 
to the liquid water form at 0° centigrade and changes in turn (at a 
pressure of 1 atmosphere) from the liquid water form to the gaseous 
steam form at 100° centigrade. Under certain circumstances two 
phases can, therefore, coexist in equilibrium with each other (e.g., ice 
and liquid water at 0° centigrade).. A good theory of systems in equi- 
librium should allow us to make statements about the conditions of 
temperature and pressure under which two phases are thus able to 
coexist in equilibrium. It should also allow us to predict at what tem- 
perature a particular solid substance melts so as to become a liquid, 
and at what temperature the liquid vaporizes to become a gas. 

These are actually very difficult and fascinating problems. The 
concept of degree of randomness, or order, is again crucial. As the 
absolute temperature (or mean energy per atom) of a substance is in- 
creased, it first changes from the most orderly (or least random) solid 
form to the liquid form which is of an intermediate degree of order; 
as the temperature is increased further, it changes from this liquid form 
to the gas form which is the most nearly random or disordered. But 
it is very striking that these changes from one degree of order to the 
next take place very abruptly at at extremely well-defined temperatures. 
The essential reason is a kind of critical instability involving[all] the 
molecules of the substance. For example, suppose that the μι τ ΒΒ 
temperature of a solid is sufficiently high so that its molecules, as a 
result of their relatively large mean energy, can oscillate about their 


a 
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regular lattice positions with displacements large enough to be com- 
parable with their intermolecular separation. Suppose then that there 
occurs a fluctuation in which a few adjacent molecules move simul- 
taneously out of their regular lattice positions; this makes it easier for 
neighboring molecules slightly farther away also to move out of their 
regular positions; and so forth. The net result is then somewhat analo- 
gous to the collapse of a pile of dominoes; i-e., the high degree of order 
of the solid suddenly begins to disintegrate all at once and the solid 
turns into a liquid. This instability, which results in the melting of 
the solid, involves jointly all the molecules of the solid; it is, therefore, 
said to be a cooperative phenomenon. Since it does require the analy- 
sis of all the molecules in simultaneous interaction, the theoretical 
problem of treating any cooperative phenomenon such as melting or 
vaporization from a detailed microscopic point of view is very difficult 
and challenging. 


Systems which are not in equilibrium 
The discussion of systems which are not in equilibrium is ordinarily 
much more difficult than that of systems which are in equilibrium. 
Here it is necessary to deal with processes which change in time and 
the interest is focused on how fast or slowly they change. The discus- 
sion of such questions requires a detailed analysis of how effectively 
molecules interact with each other. Except in relatively simple cases 
such as dilute gases, this kind of analysis can become fairly complicated. 

Let us mention again a few typical problems of interest. Consider, 
for example, the chemical reaction (34). Suppose that some COs gas 
is introduced into the container at some given elevated temperature. 
How long would it take to achieve the equilibrium concentration of 
CO? Thus one would like to calculate the rate at which the chemical 
reaction (34) proceeds from left to right. τος 

As another illustration, consider two large bodies which are con- 
nected by a rod and which are at different temperatures ΤΊ and Tp. 
Since this is not an equilibrium situation, heat will flow from one body 
to the other through the connecting rod. But how effectively is the 
energy transported down the rod. i.e., how long does it take for a given 


Y= Qik) arnount of heat to flow from one body to the other? This depends on 


an intrinsic property of the rod, its “thermal conductivity.” For ex- 
ample, a rod of copper allows heat flow to proceed much more readily 
than a rod made of stainless steel: i.e.. copper has a much higher ther- 
mal conductivity than stainless steel. The task of the theory is to 
define “thermal conductivity” precisely and to calculate this parame- 
ter from first principles. co 


Heat flow 
πον τος 


Fig. 1.39 Two bodies at different tempera- 
tures connected by a rod which conducts heat 
from one to the other. 


50 Characteristic Features of Macroscopic Systems 


Concluding Remarks 


The preceding survey of problems indicates to some extent the range 
of macroscopic natural phenomena which we might like to treat quan- 
titatively on the basis of fundamental microscopic considerations. We 
shall not Jean how to discuss all such problems in the remainder of 
this book. Indeed, some of the questions which we have raised (e.g., 
calculations of phase transformations such as melting or vaporization), 
give rise to problems which are still not adequately solved and which 
are thus active areas of research. On the other hand, we are now well 
prepared to turn the qualitative considerations of this chapter into a 
more systematic quantitative discussion of the properties of macro- 
scopic systems. This discussion will go a long way toward answering 
the most basic questions which we have raised. 


Summary of Definitions 

isolated system A system which does not interact with any other system. 

ideal gas A gas of molecules whose mutual interaction is almost negligible (i.e., the inter- 
action is large enough to ensure that the molecules can exchange energy among them- 
selves, but is otherwise negligible). 

ideal spin system A system of spins whose mutual interaction is almost negligible (i.¢., 
the interaction is large enough to ensure that the spins can exchange energy among 
themselves, but is otherwise negligible). 

microscopic Small, of the order of atomic dimensions or less. 

macroscopic Very large compared to atomic dimensions 

microscopic state (or microstate) State of a system described in microscopic detail by 
the most complete specification, according to the Jaws of mechanics, of all the atoms in 
the system. 

macroscopic state (or macrostate) State of a system described, without attention to 
microscopic details, by specifying only quantities which can be determined by macro- 
scopic measurements. 

macroscopic parameter A parameter which can be determined by large-scale measure- 
ments and which describes the macroscopic state of a system. 

equilibrium A macroscopic state which does not tend to change in time, except for 
random fluctuations. 

relaxation time Approximate time required for a system, starting from a situation far 
removed from equilibrium, to attain equilibrium. 

irreversible process A process such that the time-reversed process (w hich would be 
observed in a movie played backward) would almost never occur in actuality. 

thermal interaction Interaction which does not involve the performance of work on a 
macroscopic scale. 

heat Energy transfer which is not associated with the performance of macroscopic work, 
but occurs on the atomic scale. 

thermometer A small macroscopic system arranged so that only one of its macroscopic 
parameters changes when the system absorbs or gives off heat. 

thermometric parameter The one variable macroscopic parameter of ἃ thermometer. 
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temperature of a system with respect to a given thermometer The value of the ther- 
mometric parameter of the thermometer when the latter is placed in thermal contact 
with the system and allowed to come to equilibrium with it 

mean free path Τῆς average distance which a molecule 
with another molecule. 


Js ina gas before it collides 


Suggestions for Supplementary Reading 


F. J. Dyson, “What is Heat?”, Sci. American 191, 58 (Sept. 1954). 

R. Furth, “The Limits of Measurement,” Sci. American 183, 48, (July 1950). A discussion 
of Brownian motion and other fluctuation phenomena. 

B. J. Alder and T. E. Wainwright, “Molecular Motions,” Sci. American 201, 113 (Oct. 1959). 
This article discusses the application of modem high-speed computers to the study of 
molecular motions in various macioyvopi¢ systems, 


Problems 
1.1 Fluctuations in a spin system 

Consider an ideal system of 5 spins in the absence of an external mag- 
netic field. Suppose that one took a movie of this spin system in equilibrium. 
What fraction of the movie frames would show n spins pointing up? Consider 
all the possibilities n = 0, 1, 2, 3, 4, and 5. 


1.3 Diffusion of a liquid 

Suppose that a drop of a dye (having the same density as water) is intro- 
duced into a glass of water. The whole system is kept at a constant temperature 
and is left mechanically undisturbed. Suppose that one took a movie of the 
process occurring after the drop of dye had been put into the water. What 
would one see on a screen on which the movie is projected? What would one 
see if the movie is run backward through the projector? 15 the process reversible 
orirreversible? Describe the process in terms of the motion of the dye molecules. 


1.3 Microscopic explanation of friction 

A wooden block, which has originally been given a push, is sliding on 
the floor and gradually ¢omes to rest. Is this process reversible or irreversible? 
Describe the process as it would appear on ἃ movie played backward. Discuss 
what happens during this process on the microscopic scale of atoms and mole- 
cules. 


1.4. The approach to thermal equilibrium 

Consider two gases A and A’ in separate containers. Initially the aver- 
age energy of a molecule of gas A is quite different from the average energy of 
a molecule of gas A’. The two containers are then placed in contact with each 
other so that energy in the form of heat can be transferred from gas A to the 
molecules of the container walls and thence to gas A’. Is the ensuing process 
reversible or irreversible? Describe in microscopic detail the process that would 
appear if one filmed the situation and ran this film backward through a projector. 
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1.5 Variation of gas pressure with volume 

A container is divided into two parts by a partition. One of these parts 
has a volume V; and is filled with a dilute 
the partition and wait until the final equilibrium condition is attained where the 
molecules of the gas are uniformly distributed throughout the entire container 
of volume Vj. 


as: the other part is empty. Remove 


(a) Has the total energy of the gas been changed? Use this result to compare 
the average energy per molecule and the average speed of a molecule in the 
equilibrium situations before and after the removal of the partition. 

(b) What is the ratio of the pressure exerted by the gas in the final situation 
to that of the pressure exerted by it in the initial situation? 


1.6 Number of gas molecules incident on an area 

Consider nitrogen (Ng) gas at room temperature and atmospheric pres- 
sure, Using the numerical values given in the text, find the average number of 
No molecules striking a 1-cm? area of the container walls per second, 


1.17 Leak rate 

A 1-liter glass bulb contains No gas at room temperature and atmospheric 
pressure. The glass bulb, which is to be used in conjunction with some other 
experiment, is itself enclosed in a large evacuated chamber. Unfortunately the 
glass bulb has, unbeknown to the experimenter, a small pinhole about 10- cm 
in radius, To assess the importance of this hole, estimate the time required for 
1 percent of the Nz molecules to escape from the bulb into the surrounding 
vacuum. 


1.8. Average time between molecular collisions 

Consider nitrogen gas at room temperature and atmospheric pressure. 
Using the numerical values given in the text, find the average time a Ny molecule 
travels before colliding with another molecule. 


*1.9 Equilibrium between atoms of different masses 

Consider a collision between two different atoms having masses m, and 
ma, Denote the velocities of these atoms before the collision by νὰ and ve, respec- 
tively; denote their velocities after the collision by νγ΄ and vo’, respectively. It is 
of interest to investigate the energy transferred from one atom to the other as a 
result of the collision. 

(a) Introduce the relative velocity V =v, — v2 and the center-of-mass velocity 
c= (myyi + mov2)/(m, + mz). The relative velocity after the collision is then 
V’ = vi' — vo’. Ina collision ¢ remains unchanged by virtue of conservation of 
momentum, while ΓΝ = ΓΝ] by virtue of conservation of energy. Show that the 
energy gain Ae; of atom 1 in a collision is given by 


Ae, = dmy(vy’? — 02) = myma(my + m)1e- (Ν᾽ — V). (i) 


(b) Denote by 6 the angle between V’ and V, by g the angle between the plane 
containing V’ and V and the plane containing c and V, and by Ψ the angle between 
cand V. Show that (i) then becomes 


Ae, = myma(my + ma)-2eV [(cos θ-- 1) cosy + sin Bsinycosq] (ἢ) 
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where cVcos Ψ = ¢+V = (my + ma)“!fmyo;? + (m2 — my)vs ve — mov2")- 

(c) Consider two atoms of this kind in a gas where many collisions occur. On 
the average, the azimuthal angle ¢ is then as often positive as negative so that 
τος Φ = 0; also, since νὰ and vz have random directions, the cosine of the angle 
between them is as often positive as negative so that ¥j> ve = 0. Show that (ii) 


therefore becomes, on the average, 


2mym 


= creer = cos δγζες — &) (iii) 


ae, 

where εἰ = $0? and ἐς = }mpv2. 
In the equilibrium situation in particular the energy of an atom must, on the 
average, remain unchanged so that Ae; = 0. Show that (iii) then implies that 


in equilibrium, a=. (wv) 


Thus the average energies of interacting atoms in equilibrium are equal even if 
the masses of the atoms are different. 


1.10 Comparison of molecular speeds in a gas mixture 
Consider a gas mixture enclosed in some container and consisting of 
monatomic molecules of two different masses m, and me. 

(a) Suppose that this gas mixture is in equilibrium. Use the result of the pre- 
ceding problem to find the approximate ratio of the average speed Ὁ. of a mole- 
cule of mass m; to the average speed ὃ of a molecule of mass me. 

(b) Suppose that the two kinds of molecules are He (helium) and Ar (argon) 
which have atomic weights equal to 4 and 40, respectively. What is the ratio 
of the average speed of a He atom to that of an Ar atom? 


1.11 Pressure of a gas mixture 

Consider an ideal gas which consists of two kinds of atoms. To be 
specific, suppose that there are, per unit volume, nj atoms of mass ΠῈΣ and ng 
atoms of mass mz. The gas is assumed to be in equilibrium so that the average 
energy € per atom is the same for both kinds of atoms, Find an approximate 
relation for the average pressure p exerted by the gas mixture. Express your 
result in terms of €. 


1.18 Mixing of two gases 

Consider a container divided into two equal parts by a partition. One 
of these parts contains 1 mole of helium (He) gas, the other 1 mole of argon (Ar) 
gas. Energy in the form of heat can pass through the partition from one gas to 
the other. After a sufficiently long time, the two gases will, therefore, come to 
equilibrium with each other. The average pressure of the helium gas is then Pi, 
that of the argon gas is pz. 

(a) Compare the pressures p; and p2 of the two gases. 
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Basic Probability Concepis 


The considerations of the preceding chapter have indicated that prob- 
ability arguments are of fundamental significance for the understand- 
ing of macroscopic systems consisting of very many particles. It will, 
therefore, be useful to review the most basic notions of probability 
and to examine how they can be applied to some simple, but impor- 
tant, problems. Indeed, this discussion is likely to be valuable in 
contexts far wider than those of immediate interest to us. For exam- 
ple, probability concepts are indispensable in all games of chance, in 
the insurance business (with its need to assess the probable occurrence 
of death or disease among its clienis), and in sampling procedures such 
as public opinion polls. In biology, they are profoundly important in 
the study of genetics. In physics, they are needed in treating the 
occurrence of radioactive decay, the arrival of cosmic rays at the sur- 
face of the earth, or the random emission of electrons from the hot 
filament of a vacuum tube; furthermore, they play a fundamental role 
in the quantura-mechanical description of atoms and molecules. Most 
important to us, they will form the basis of our entire discussion of 
macroscopic systems. 


2.1 Statistical Ensembles 

Consider a system A on which one can perform experiments or obser- 
vations.j In many cases the particular outcome resulting from the 
performance of a single experiment cannot be predicted with certainty, 
either because this is intrinsically Impossiblef or because the informa- 
tion available about the system is insufficient to permit such a unique 
prediction. Although it is not possible to make statements about a 
single experiment, it may still be possible to make significant state- 


ments about the outcomes of a large number of similar experiments. 
We are thus led to a statistical description of the system, ie., to a 


+ An act of observation can be regarded as an experiment in which the result of the 
observation constitutes the outcome ofthe experiment. Therefore we need make no dis- 
tinction between experiments and observations. 


} This is the situation, for example, in quantum mechanics where the outcome of a 
meastrement on ἃ microscopic system is ordinarily not predictable with certainty. 


"Ὁ 
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description in terms of probabilities. To achieve such a description, 
we proceed in the folowing way: 

Instead of focusing attention on the single system A of interest, we 
contemplate an assembly (or an ensemble, in more customary termi- 
nology) consisting of some very Jarge number 1 of “similar” systems. 
tn principle, 4 is imagined to be arbitrarily Jarge (i.e., W — 00). The 
systems are supposed to be “similar” in the sense that each system sat- 
isfies the same conditions known to be satisfied by the system A. 
This means that each system is imagined to have been prepared in 
the same way as A and to be subjected to the same experiment as A. 
We can then ask in what fraction of cases a particular outcome of 
ihe experiment does occur. To be precise, we arrange matters so that 
we can e rate in some convenient way all the possible mutually 
eaclusive outcomes of the experiment. (The total number of such 
possible outcomes may be finite or infinite.) Suppose then that a par- 
ticular outcome of the experiment is labeled by r and that there are, 
among the 4” systems of the ensemble, 7’, systems which exhibit this 


outcome. Then the fraction 
P= a (where A” — oo) (1) 


is called the probability of occurrence of the outcome r. To the extent 
that A” is made very large, a repetition of the same experiment on the 
ensemble is expected to lead with increasing reproducibility to the 
same ratio W/V. The definition (I) becomes thus unambiguous in 
the limit where ” is made arbitrarily large. 

The preceding discussion shows how the probability of occurrence 
of any possible outcome of an experiment on‘a system can be measured 
by repeating the experiment on a large number / of similar systems.t 
Although the outcome of the experiment on a single system cannot 
be predicted, the task of a statistical theory is then that of predicting 
the probability of occurrence of each of the possible outcomes of the 


experiment. The predicted probabilities can then be compared against 
ries actnally measured by experiments on an ensemble of 


al enansple wiil serve to illustrate this statistical description in 
a number of concrete cases. 


mation being contemplated is independent of time, one could equally well 
oriment 7 times in succession with: the one particular system under 
to start the experiment each time with the system in the same 
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Fig. 2.1 In onder to make probability state- 
ments about the tossing of a single coin, one 
considers an ensemble consisting of a large 
number Ff of similar coins. 


cates scher 
semble after each coin has been tossed. The 


symbol H denotes “head,” the symbol T “tail.” 


The diagrain indi- 
matically the appearance of this en- 
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Tossing of coins or dice 


Consider the experiment of tossing a 
coin. There are only two possible out- 
comes of such an experiment, either 
“head” or “tail,” depending on whether 
the coin comes to rest ona table with the 
engraved head uppermost or its other 
side uppermost.t, In principle, the out- 
come of the experiment could be com- 
pletely predicted if we knew exactly how 
the coin is tossed and the forces by which 
it interacts with the table. It would then 
only be necessary to make the requisite 
complicated calculations based on the 
laws of classical mechanics. But such 
detailed information about the tossing of 
the coin is not available in practice. It 
is, therefore, not possible to make a 
unique prediction about the outcome of 
a particular toss. (Indeed, even if all the 
requisite information could be obtained 
and all the requisite computations could 
be performed, we would ordinarily not be 
interested in making precise predictions 
at the expense of such enormous com: 
plexity.) The statistical formulation of 
the experiment i 


t_is, however, very easy 
and familiar. We need only consider an 
ensemble consisting of a very large num- 
ber & of similar coins. When these 
coins are tossed in similar ways, we can 
count the fraction of cases where the out- 
come is a head and that where it is a 
tail.¢ These fractions then give, respec- 
tively, the measured probability p of ob- 
taining a head and probability q of obtain- 
ing a tail. A statistical theory would 
attempt to predict these probabilities. 
For example, if the center of mass of the 
coin coincides with its geometrical cen- 
ter, the theory might be based on the 


symmetry argument that there is nothing 
in the laws of mechanics which distin- 
guishes between a head or a tail, [ἢ this 
case, half the experimental outcomes 
should be heads and half tails, so that 
p=4q=4. Comparison with experiment 
might or might not verify the theory. For 
example, if heads were observed to oc- 
cur more frequently than tails, we might 
conclude that the theory is not justified 
in assuming that the center of mass of 
the coin coincides with its geometrical 
center. 

Consider now the somewhat more 
complicated experiment of tossing a set 
of Ncoins. Since the tossing of any one 
coin can have 2 possible outcomes, the 
tossing of the set of N coins can then 
have any of 2x 2x +++ X 2 = 2% pos- 
sible outcomes.§ The statistical formu- 
lation of the experiment requires again 
that, instead of dealing with a single set 
of N coins, we contemplate an ensemble 
consisting of «γ΄ such sets of N coins, 
where each set is tossed in 8 similar way. 
A possible question of interest is then 
the probability that any particular one of 
the 2" possible outcomes occurs in the 
ensemble. A less detailed question of 
interest might concern the probability of 
finding in the ensemble an outcome 
where any n of the coins show heads and 
the remaining (N — n) coins show tails. 

The problem of tossing a set of N dice 
is, of course, analogous. The only dif- 
ference is that the toss of any one die 
can have 6 possible outcomes, depend- 
ing on which of the six faces of the cubi- 
cal die lands uppermost. 


It is often convenient to use the simple word event to denote the 
outcome of an experiment or the result of an observation. Note that 
the probability of “Occurrence of an event depends crucially on the 
information available about the system under consideration. Indeed, 


4 We neglect the remote possibility that the coin comes to rest standing on its edge. 
{ Alternatively, we might take the same coin and toss it A times in succession, counting 


the fraction of cases showing heads or tails. 


§ In the special case where N = 4, the 34 = 16 possible outcomes are listed explicitly 
in Table 1.1 (p. 7) if one interprets the letter L as a head and the letter R as a tail. 
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this information determines the kind of statistical ensemble to be con- 
templated, since this ensemble must consist solely of systems which 
satisfy all the conditions satisfied by the particular system under 


consideration. 


Example 


Suppose that we are interested in the 
following question: What is the probabil- 
ity that a person living in the United 
States will be hospitalized at some time 
between the ages of 23 and 24 years? 
Then we must consider an ensemble 
consisting of a large number of persons 
in the United States and must ascertain 
the fraction of these persons hospitalized 
at some time between the ages of 23 and 


told that the person is of the female sex. 
Then the answer to our question changes 
because we must now contemplate an 
ensemble of women living in the United 
States and must ascertain how many of 
these women are hospitalized at some 
time between the ages of 23 and 24. 
(Indeed, women of this age tend to enter 
hospitals because of childbirth, a condi- 
tion from which men are exempt.) 


24. But suppose that we are now also 


Application to systems of many particles 

Consider a macroscopic system consisting of many particles. For 
example, the system might be an ideal gas of N molecules, a system 
of N spins, a liquid, or a piece of copper. In none of these cases is it 
possible to make a unique prediction about the behavior of each of 
the particles in the system, nor is it of interest. We resort, therefore, 
to a statistical description of the system A under consideration. In- 
stead of considering the single system A, we contemplate an ensemble 
consisting of a large number 1 of systems similar to A. To make a 
statistical statement about the system at the time t, we make observa- 
tions on the M systems at this time t. Thus we can determine the 
probability P,(t) that the observation yields a particular outcome r at 
the time ¢. The procedure is most readily visualized by imagining 
that we take a motion picture of each system in the ensemble. We 
then end up with / film strips which contain the results of all obser- 
vations on the systems of the ensemble. The behavior of any one 
system in the ensemble, say system number k, as a function of time 
can then be obtained by looking at the kth film strip (i.e., by looking 
along a particular horizontal line in Fig. 2.3). On the other hand, 


} In the correct quantum-mechanical description of the system nonstatistical predictions 
are not possible, even in principle. In a classical description a unique prediction about a 
system would require knowledge of the position and velocity of each particle at one time, 
information which is not available to us. 
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Fig, 2.2 Statistical description of a system A 
consisting of a gasina box. The diagram illus- 
trates schematically a statistical ensemble of 
% such systems similar to the system A under 
consideration. 
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Fig, 2.3 Computermade pictures showing a statistical ensem- 
ble of systems. ‘Tisls ensemble was constructed to represent a 
system consisting οἵ 40 particles in a box when the information 
= available about the system is the following: All particles are 
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known to be in the left half of the box at some initial time cor- 
responding to the frame j = 0, but nothing else is known about 


their positions or velocities. 
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Fig, 2.3 (cont.) 

The evolution in time of the kth system in the ensemble can _ system at any time corresponding to the jth frame can be made 
be followed by looking horizontally at the successive frames ΒΥ looking vertically at all the systems in their jth frame and 
j= 0,1, 2,... for this systema. Statistical statements about the _ doing the counting necessary to determine probabilities. 

61 


Fig. 3.4 Continuation of Fig. 2.3. The ensemble has by now 
become time-independent, i.e., the system has attained equi- 
librium. 
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probability statements about/the system | st any one time ¢ are obtained 
by looking at all the film frames taken at the particular time t (i.e., by 
looking along a particular vertical line in Fig. 2.3 and by counting the 
fraction of systems exhibiting a given outcome at this time). 

A statistical ensemble of systems is said to be time-independent if 
the number of its systems exhibiting any particular event is the same 
at every time (or equivalently, if the probability of occurrence of any 
particular event in this ensemble is independent of time). The sta- 
tistical description then provides a very clear definition of equilibrium: 
An isolated macroscopic system is said to be in equilibrium if a statis- 


tical ensemble of such systems is time-independent. 


Example 


Consider an ideal gas of N molecules. 
At some initial time to, immediately after 
a partition has been removed, all the 
molecutes of this gas are known to be in 
the left half of a box. How do we go 
about giving a statistical description of 
what happens at ali subsequent times? 
We need only consider an ensemble con- 
sisting of some large number of simi- 
lar boxes of gas, each one of which has 
all its molecules concentrated in its left 
half at the time to. Such an ensemble is 
shown schematically in Fig. 2.3. We can 
then consider this ensemble at any time 
t > to and ask various questions of inter- 
est. For example, focusing attention on 
any one molecule, what is the probability 
p(t) that this molecule is in the left half 
of the container or the probability q(t) 
that it is in the right half of the con- 


2 tainer? Or what is the probability P(n,t) 


that, at any time t, n of the N mole- 
cules are located in the left half of the 
container? At the initial time to, we 
know that p(to)=1 while φζιο) = 0. 


[Similarly, PQV,to.) = 1, while P(n,to) = 0 
for n=£N.] As time goes on, all these 
probabilities change until the molecules 
become uniformly distributed through- 
out the box so that p= q=4. There- 
after the probabilities remain unchanged 
in time, i.e., the ensemble has become 
time-independent and the system has 
attained equilibrium (see Fig. 2.4).+ 
This time-independent situation is, of 
course, particularly simple. Indeed, the 
problem of the gas of N molecules then 
becomes analogous to the previously dis- 
cussed problem of the set of N coins. In 
particular, the probability p of finding 2 
molecule in the left half of the box is 
analogous to the probability p that a 
tossed coin shows a head; similarly, the 
probability ῳ of finding a molecule in the 
right half of the box is analogous to the 
probability q that a coin shows a tail. 
Furthermore, as in the case of the coin, 
these probabilities are time-independent 
and such that p = q = ὁ. 


+ Note that, despite the erratic fluctuations occurring in any single system as time goes 
on, the probability P in the ensemble at any one instant of time has always a unique well- 
defined value since the number of systems in the ensemble is arbitrarily large. This 
remark illustrates the great conceptual simplicity of thinking in terms of ensembles rather 
than in terms of a single system. 
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Remark 


In Eq. 1.4 of Chap. 1 we calculated prob- 
abilities by considering only a single sys- 
tem. This is ordinarily a valid procedure 
in the special case of a system in equi- 
librium. Since an ensemble of such 
systems is time-independent, a large 
number of successive observations on a 
single system is then equivalent to a 
large number of simultaneous observa- 
tions on the many systems of the ensem- 


takes a film strip of a single system over 

a time + and cuts it into 7 long pieces, 

each one of duration 73=7/M (where ~ 
τῷ is so long that the behavior of the sys- 
tem in one piece is independent of its 
behavior in an adjacent piece). Then 
the collection of these ¥ film strips of a 
single system is indistinguishable from a 
collection of ¥ film strips taken of all the 
systems in the ensemble during any time 


ble. In other words, suppose that one _ interval of duration 71. 


2.2 Elementary Relations among Probabilities 

Probabilities satisfy some simple relations which are almost self- 
evident, but quite important. It will be worth deriving these relations 
by starting directly from the definition (1) of a probability. Through- 
out the following discussion the number /Y of systems in the ensemble 
will always be understood to be infinitely large. 

Suppose that experiments on some system A can lead to any of a 
mutually exclusive outcomes. Let us label each such outcome or 
event by some index στ; the index r can then refer to any of the a num- 
bers r = 1,2,3,..., or a. In an ensemble of similar systems, 7; of 
them will be found to exhibit event 1, 42 of them event 2,..., and 
4, of them event a. Since these a events are mutually exclusive and 
exhaust all possibilities, it follows that 


Nyt Not --- + Nea. 
Dividing by 1, this relation becomes 
Pp t+ Po+---+Pe=1 (2) 
where P,=4,/N denotes the probability of occurrence of event r 
in accordance with the definition (1). The relation (2), which states 
merely that the sum of all possible probabilities adds up to unity, is 
called the normalization condition for probabilities. Using the 


summation symbol = defined in (M.1), this relation can also be written 
more compactly as - Ξ 


Σ», Ξτ 8 


What is the probability of occurrence of either event r or event s? 
There are «77, systems in the ensemble exhibiting event r and 1, sys- 
tems exhibiting event 8. Hence there are (A, + ‘s) systems exhibit- 
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ing either event r or event s. Correspondingly the probability P(r or s) 
of occurrence of either alternative, event r or s, is simply given by 


P(r or 5) = 


so that 


Example 


Suppose that we consider the tossing of 
a die which, by virtue of its symmetry, 
has equal probabilities 4 of landing with 
any of its six faces uppermost. The 
probability that the die exhibits the num- 
ber 1 is then ἢ, as is the probability that 


Net Ms 


N 


that the die exhibits either the number 1 
or the number 2 is then, by (4), simply 
$+4=4. Thisis, of course, an obvious 
result since the events where 1 or 2 are 
uppermost represent one-third of all the 
six possible events 1, 2, 3, 4, 5, or 6. 


it exhibits the number 2. The probability 


The relation (4) is immediately generalizable to more than two 
alternative events. Thus the probability of occurrence of either one 
of several events is simply equal to the sum of their respective proba- 
bilities. In particular, we see that the normalization condition (2) 
merely states the obvious result that the sum of the probabilities on 
the left (ie., the probabilities of occurrence of either event 1 or event 
2... or event a) is simply equal to unity (i.e., equivalent to certainty) 
since all possible events are exhausted in the enumeration of the a pos- 
sible alternatives. 


Joint probabilities 

Suppose that the system under consideration can exhibit two different 
types of events, namely a possible events of a type labeled by r (where 
the index r = 1, 2,3,..., a) and B possible events of a type labeled 
by s (where the index s = 1, 2,3,..., 8). We shall denote by P,, the 
probability of joint occurrence of both event r and event s. That is, 
in an ensemble consisting of a large number of similar systems, Ws 
of these are characterized by the joint occurrence of both an event r 
of the first type and an event 5 of the second type. Then P,,= Ays/M. 
As usual, we denote by P, the probability of occurrence of an event r 
(irrespective of the occurrence of events of type s). That is, if in the 
previous ensemble one pays no attention to events of type s and counts 
“, systems in the ensemble exhibiting event r, then P,=1,/1. 
Similarly, we denote by P, the probability of occurrence of an event s 
(irrespective of the occurrence of events of type r). 
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A special, but important, case is that where the probability of occur- 
rence of an event of type s is unaffected by the occurrence or nonoc- 
currence of an event of type τ. The events of type r and 5 are then 
said to be statistically independent or uncorrelated. Now consider in 
the ensemble the 1, systems which exhibit any particular event r. 
Irrespective of the particular value of τ, a fraction P, of these systems 
will then also exhibit the event s. Thus the number 1; of systems 
exhibiting jointly both r and s is simply 


Ny = NPs 


Correspondingly the probability of joint occurrence of both r and s is 
given by 


‘Hence we conclude that 


if the events rand 5 are statistically independent, 


Pi = Ῥ,Ρ,. 


Note that the result (5) is not true if the events r and s are not statis- 
tically independent, The relation (5) can be immediately generalized; 
thus the joint probability of more than two statistically independent 


events is simply the product of their respective probabilities. 


Exanipie 


Suppose that the system A under cor: 
sideration consists of two aice A; and Ap. 
An event of type r might be the eppear- 
ance uppermost of ny one of the 6 faces 
of the die Αι; similarly an event of types 
might be the appearance uppermost of 
any one of the 6 faces of die Ay. An 
event of system A is then specified by 
stating which face of die A, is uppermost 
and which face of die A» is uppermost. 
An experiment involving the tossing of 
the two dice would thus have 6 x 6 = 36 
possible outcomes. Probability state- 
ments can be mede about an ensemble 
consisting cf a large number Jf of similar 
pairs of dice. We suppose that each die 
is symmetric so that it is equally likely to 
ind with any ene of its Ὁ faces upper- 


most. The probability P, that each die 
lands with any particular fece r upper- 
most is then simply }.  !f the dice do not 
interact with each otner {e.g., if they are| 
not both magnetized so as to exert forces 
tending to align cach other) and are not 
tossed in precisely identical ways, they 
can be considered statistically independ-| 
ent. in this case the joint probability 
P,, that die A; lends with a particular face 
τ uppermost and that die 4: lands with a 
particular face s uppermost is simply 


Pa=PP=txt=h 


This result is, of course, quite evident 
since the contempizied event represents 
one out of 6 x 5 = 36 possible out- 
comes. 
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3.3 The Binomial Distribution 
We have now acquired sufficient familiarity with statistical methods 
to give a quantitative discussion of some physically important prob- 
lems. Consider, for instance, an ideal system of N spins }, each having 
an associated magnetic moment fg. This system is of particular in- 
terest since it is a very simple system readily described in terms of 
quantum mechanics: it will therefore often be useful as a prototype of 
more complicated systems. Yor the sake of generality, we suppose 
that the spin system is located in an external magnetic field B. Each 
magnetic moment can then point either “up” (i.e., parallel to field B), 
x “down” {i.e,, antiparallel to the field B). We assume that the 
spin system is in equilibriun. A statistical ensemble consisting of “7 
such spin systems is thus time-independent. Focusing attention on 
any one spin, we shall denote by p the probability that its magnetic 
moment points up and by q the probability that it points down. Since 
these two orientations exhaust all the possibilities, the normalization 
requirement (3) has the obvious consequence that 


p+q=l ©) 


orq=1—p. In the absence of the field, when B = 0, there is no 
preferred direction in space so that p = q = }. But in the presence 
of the field, a magnetic moment will be more likely to point along the 
field than opposite to it so that p > q.} Since the spin system is ideal, 
the interaction between the spins is almost negligible so that their ori- 
entations can be considered statistically independent. The probability 
that any given moment points up is, therefore, unaffected by whether 
any other moment in the system points up or down. 

Among the N magnetic moments of the spin system, let us denote 
by n the number of magnetic moments which’ point up and by n’ the 
number of magnetic moments which point down. Of course, 


n+n=N (7) 


so that n’ = N—n, Consider then the spin systems in the statistical 
ensemble. The number n of moments pointing up is not the same in 
each system, but can assume any of the possible values n = 0, 1,2, 

.,N. The question of interest is then the following: For each pos- 


+ We shall consider p and q as given quantities deteumined by experiment. In Chap. 4 
we shall learn how to calculate p and q for any Value of B if the spin system is Seen to 
be at ἃ given: temperature. 


Fig. 2.5 A system consisting of N spins ὁ in 
the special case where N= 4. Each arrow 
indicates the direction of the magnetic moment 
of a spin. The external magnetic field is de- 
noted by B. 


Table 3.1. A table T which lists all the possi- 
ble orientations of N magnetic moments in 
the special case where N = 4. The letter U 
indicates a moment which points up, the letter 
Τὸ ἃ moment which points down. The num- 
ber of moments pointing up is denoted by n, 

_ the uumber pointing down by π΄. The num- 
ber C(x) of possible configurations in which 
n of the N moments point up is indicated in the 
last column. (Note that this table is analogous 
to Table 1.3.) 
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sible value of n, what is the probability P(n) that n of the N magnetic 
moments point up? 

The problem of finding the probability P(n) is readily solved by the 
following argument. The probability that any one magnetic moment 
points up is p, the probability that it points down is q = 1 — p. Since 
all the magnetic moments are statistically independent, the general 
relation (5) allows us immediately to say that 


the probability of occurrence of 

one particular configuration = naa gt 
where n moments point up and | — ῬΈΣΞΕΡ 2S Bias ©) 
the remaining n’ point down nfactors π' factors 


But a situation where any n moments point up can ordinarily be 
achieved in many «ternative ways, as is illustrated in Table 2.1. Let 
us, therefore, introduce the notation 


Cyn) = the number of distinct configurations of N moments 

where any ἡ of these moments point up (and the (9) 

remaining n’ point down).t 
The desired probability P(n) that n of the N moments point up is equal 
to the probability that either the first, or the second, ..., or the last 
of the C;{n) alternative possibilities is realized. In accordance with 
the general relation (4), the probability P(n) is thus obtained by sum- 
ming the probability (8) over the Cx(n) possible configurations having 
n moments pointing up, ie., by multiplying the probability (8) by 
Cyn). Thus we get 

P{n) = Cy(n)prgi- (10) 


where we have put n’ = N — π΄ 

It only remains to calculate the number of configurations Cy(n) in 
the general case of arbitrary N and σι. Suppose then that we consider 
a table T, similar to Table 2.1, in which we list all the possible configu- 
rations of the N moments and designate by U each moment which 
points up and by D each moment which points down. The number 
Gx(n) is then the swsber of entries in which the letter U appears n 
times. To examine how many such entries there are, let us consider 
n distinct moments which point up and label them by the letters 
Uy, Us,..., Uy. In how many avays can we list these in a table T’ 
(such as Table 2.2 shown for the special case where N = 4 and n = 2)? 


+ The number Cyin) is sometimes called the number of combinations of N objects taken 
n ata time. 
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The letter U; can be listed in a line of the table in any 
one of N different places; 

for each possible location of Uj, the letter U2 can then 
be listed in any one of the remaining (N — 1) places; 

for each possible location of Uy and Us, the letter Us can 
then be listed in any one of the remaining (N — 2) 
places; 


for each possible location of U3, U2, ..., Uy-1, the last 
letter U, can then be listed in any one of the remain- 
ing (Ν — n + 1) places. 


The possible number J(n) of distinct entries in the table T’ is then 
obtained by multiplying together the number of possible locations of 
the letters Uy, Us, ..., Un; ie., 


Jx{n) = N(N — 1)(N — 2) ++» (N—n 4+ 1). 
This can be written more compactly in terms of factorials. Thust 


(11) 


i») — ΝΙΝ — ἈΝ -- 3) --- (N—n + 1)(N—n)--- () 
Le eta ge ---τ 

N!} 9 

“qr (12) 


The symbols Uj, Uz, ... , Un were considered distinct in the preceding 
enumeration, while the subscripts are reully irrelevant since all mo- 
ments pointing up are equivalent; i.e., any U; denotes a moment point- 
ing up, irrespective of i. Those entries in the table T’ which differ 
only by a permutation of the subscripts correspond, therefore, to 
physically indistinguishable situations (see, for example, Table 2.2). 
Since the possible number of permutations of the n subscripts is given 
by n!, the table T’ contains n! times too many entries if only distinct 
nonequivalent entries are to be considered.{ The desired number 
Cx(n) of distinct configurations of up and down moments is then given 
by dividing Jy{n) by πὶ so that 


Jxin) N! 


ale le i (13) 


Cyn) = 


+ By definition, N!== ΝΙΝ — 1)(N — 2) - -- (1); in addition, 0! = 1. 

{The first subscript can assume any of n possible values, the second subscript any of 
the remaining (n — 1) possible values, ..., and the nth subscript the one remaining value. 
Hence the subscripts can be arranged in n(n — 1) --- (1)=n! possible ways. 


ὮΣ pba nef = ee 
Table 2.2. A table T’ which lists al! the possi- 
ble arrangements of N = 4 identical moments, 
n = 2of which point up. For ease of enumer- 
ation, these two moments have been labeled by 
U; and Us, respectively, although they are 
physically indistinguishable. Hence entries 
which differ only in subscripts are equivalent. 
Such equivalent entries are indicated by identi- 
cal letters in the last column. The table con- 
tains thus n! = 2 times too many entries if 
one is interested only in entries which are 
physically distinct. 


E -4 2 0 2 4 m 
Fig. 2.6 Binominal distribution for N = 4 
magnetic moments when Ὁ τε 4 τε ὁ. The 
graph shows the probability P(n) that n mo- 
ments point up, or equivalently, the probability 
Pm) that the total magnetic moment in the 
up direction is equal to m (when measured in 
units of po). 


aga ἢ 
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The desired probability (10) then becomes 


ΝΙ 


Ν. 

Pi segs (14) 
or, in more symmetrical form, 

P(n) = ee where n’ =N—n. (15) 

In the special case 

eee = Nia 
when p= q =}, P(n) = ain () ; (16) 
For a given number N, the probability P(n) is a function of n and is 
called the binomial distribution. 
Remark 


In expanding a binomial of the form 
(p +)", the coefficient of the term 
prqh-" is simply equal to the number 
Ὁμ(η) of possible terms which involve the 
factor p exactly n times and the factor q 
exactly (Ν — n) times. Hence one οὔ: 
tains the purely mathematical result 
known as the binomial theorem 


N 
ΦΈΦΝ =Daweaer an 


Comparison with (14) shows that each 


Discussion 


term on the right is precisely the proba- 
bility P(n). This is the reason for the 
name ‘binomial distribution.” Inciden- 
tally, since p + q = 1 when p and q are 
the probabilities of interest to us, Eq. (17) 
is equivalent to 


N 
1= D/P). 
=0 


This verifies that the sum of the proba: 
bilities for all possible values of n is prop- 
erly equal to unity, as required by the 
normalization condition (3). 


To examine the dependence of P(n) on n, we investigate first the 
behavior of the coefficient Cyx(n) given by (13). Note first that Cy{n) 
is symmetric under interchange of nandN—n=n’'. Thus 


In addition, - 


We also note that 


Cy(n) = Cx{n). (18) 
- Cx(0) = CN) = 1. (a9) 
Cin +1) _ n\(N — n)! Ν --π (20) 


Cy{n) 


=GaeDiw—n—-Dl n+l 
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Starting with n = 0, the ratio of successive coefficients Cy(n) is thus 
* initially large, of the order of N; it then decreases monotonically with 
n, staying larger than (or at most becoming equal to) unity as long as 
n « ἘΝ, and becoming smaller than unity for n > ἐν. This behavior, 
combined with (19), shows that Cy(n) has a maximum near n = ἐν 
and that its value there is very large compared to unity, to the extent 
that N is large. 
The behavior of the probability P(n) is then apparent. By (16) it 
follows that, 


ifp=q=}, P(n') = P(n). (21) 


This result must, of course, be true by symmetry since there is no pre- 
ferred spatial orientation if p = q (1.6., in the absence of an applied 
magnetic field B). In this case the probability P(n) has a maximumt 
++ nearn=4N. On the other hand, if p > 4, the coefficient Cy(n) still 
tends to produce a maximum of P(n), but this maximum is now shifted 
toa value n >4N. Figures 2.6 and 2.7 illustrate the behavior of the 
probability P(n) in some simple cases. 


{The maximum is at $V when N is even, and straddles this value otherwise. 


Fig, 3.7 Binominal distribution for N = 20 
magnetic moments when p= ᾧ τὸ ὁ, The 
graph shows the probability P(n) that n mo- 
ments point up, or equivalently, the probability 
Pm) that the total magnetic moment in the 
up direction is equal to m (when measured in 
units of μο). 
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Fig. 2.8 The probability P’(m) 
that the total magnetic moment 
of a system of N spins καὶ is equal 
to πὶ (measured in units of μο). 
Because of the presence of an 
applied magnetic field, p = 0.7 
and ῳ = 0.3. The graphs show 


P’(m) in four different cases cor- 


responding to N = 10, N = 20, 
N = 30, and N = 50. 
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The total magnetic moment of a spin system is a quantity which is 
readily measured experimentally. Let us denote by M the total mag- 
netic moment along the “up” direction. Since M is simply equal to 
the algebraic sum of the components along the “up” direction of the 
magnetic moments of all the N spins, it follows that 


M = πμο — πίμο = (n — πΎΊμο 
or M= mp (22) 
where πὶ ΞΞΞ πὶ -- π' (23) 
and where po denotes the magnitude of the magnetic moment of a 
spin. By (22), m = M/yo is simply the total magnetic moment meas- 
ured in units of jo. The relation (23) can also be written as 
m=n—n'=n—(N—n)=2n-N. (24) 


This shows, incidentally, that the possible values of m must be odd if 
N is odd, and must be even when N is even. According to (24), a defi- 
nite value of n corresponds to a unique value of m, and conversely 


n=i(N +m). (25) 


The probability P’(m) that m assumes a certain value must then be the 
same as the probability P(n) that n assumes the corresponding value 
given by (25). Thus 


P(m) = (2 5). (26) 


This expression gives the probability of occurrence of any possible 
value of the total magnetic moment of the spin system. In the special 
case where p = q =}, the expressions (16) and (26) yield thus 
explicitly 

ΝΙ ( ‘1 y's 
Ἐξ}. 5): ay” 


2 2 


P(m) = 


the most probable situation is then clearly at (or near) m = 0 where 
δῖ τε 0: 


Generality of the binomial distribution 

Although our discussion has dealt with the specific problem of a sys- 

tem of spins, it can be phrased more abstractly. Thus we have really 

solved the following general problem: Given N events which are sta- 
+ tistically independent. Suppose that each such event occurs with a 
- probability p; the probability that it does not occur is thus given by 
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q=1-—p. What then is the probability P(n) that any n out of these 
N events do occur (while the remaining n = N — n events do not 
occur)? This question is immediately answered by the binomial dis- 
tribution (14). Indeed, in our specific example of a system of N inde- 
pendent spins, the occurrence of an event is merely represented by a 
spin pointing up, while the nonoccurrence of an event is represented 
by a spin not pointing up, ie., pointing down. 

Some further examples will serve to illustrate some common prob- 
lems which are immediately answered by the binomial distribution. 


Ideal gas of N molecules 


Consider an ideal gas of N molecules en- 
closed in a box of volume Vo. Since the 
molecules of an ideal gas interact with 
each other to an almost negligible extent, 
their motion is statistically independent. 
Suppose that the box is imagined to be 
subdivided into two parts of respective 
volumes V and V’, where 


V+V'=Vo 7) 


Consider an ensemble of many such 
boxes of gas. Let p denote the proba- 
bility that a given molecule is found in 
the volume V, and q the probability that 
it is found in the remaining volume γ΄. 
If the gas is in equilibrium, each mole- 
cule tends to be uniformly distributed 


Tossing of coins or dice 


Consider the tossing of a set of N coins. 
The behavior of these coins can be con- 
sidered statistically independent. Let p 
denote the probability that any given coin 
shows a head and q the probability that 
it shows a tail. By symmetry, we can as- 
sume that p = 4. What then is the 
probability P(n) that n of the N coins 
show heads? 

The tossing of a set of N dice is similar. 
These again can be considered statsti- 


throughout the box so that 


and (28) 


Thus p + q=1, 2s in (6). What then 
is the probability P(n) in the ensemble 
that n of the N molecules are found in 
the volume V (while the remaining π' 
N—n molecules are found in the vol- 
ume ΝΎ The answer is given by the 
binomial distribution (14). In particular, 
if V = V’sothat p = q = }, we have thus 
solved explicitly the problem of Sec. 1.1 
where we wanted to find the probability 
that n out of N molecules are found in the 
left half of a box. 


cally independent. Let "p denote the 
probability that any given die lands with 
a “6" uppermost and q=1-—p the 
probability that it does not. Since a die 
has 6 faces, we can assume by symmetry 
that p =} while q=1—p=4. What 
then is the probability P(n) that n of the 
Ν dice show a "6" uppermost? This 
question again is answered by the bino- 
mial distribution (14). 
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3.4 Mean Values 
Suppose that a variable u of some system can assume any of a possible 
distinct values 


Uy, U2,--., Ue 
with respective probabilities 


Pi, Po, .+ +> Pa 


This means that, in an ensemble of similar systems (where Y > 0), 
the variable u assumes the particular value u, in a number Y, = AP, 
of these systems. 

The specification of the probabilities P, for all the a possible values 
u, of the variable u constitutes the most complete statistical descrip- 
tion of the system. It is, however, also convenient to define parame- 
ters that characterize in a less detailed way the distribution of the 
possible values of u in the ensemble. These parameters are certain 
mean (or average) values. The notion is quite familiar. For example, 
the result of an examination given to a group of students can be de- 
scribed most completely (if one does not wish to name individual stu- 
dents) by specifying the number of students receiving each of the 
possible grades given on this examination. But the result can also be 
characterized in a less detailed manner by computing the mean grade 
of the students. This is conventionally done by multiplying each pos- 
sible grade by the number of students receiving this grade, adding all 
the resultant products, and then dividing this sum by the total number 
of students. In a similar manner, the mean value of u in the ensem- 
ble is defined by multiplying each possible value u, by the number J’, 
of systems in the ensemble which exhibit this value, adding the result- 
ant products for all α possible values of the variable u, and then divid- 
ing this sum by the total number ¥ of systems in the ensemble. The 
mean value of u (or ensemble average of u), which we shall denote by 
ii, is thus defined by 


Sra. 


—— Ayu; + «γι + «++ + Mat, _ ral 
i= ra SS (29) 
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But, since V’,/4 =P, is the probability of occurrence of the value u,, 
the definition (29) becomes simplyt 


(30) 


Similarly, if f(u) is any function of τι, the mean calue (or ensemble 
average) of f is defined by the expression 


(31) 


This definition implies that mean values have some very simple 
properties. For example, if f (u) and g(u) are any two functions of u, 


Fr e=D Pf) + aud = S rif) ἘΣ Palen) 


This result shows quite generally that the mean value of a sum of terms 
is equal to the sum of the mean values of these terms. Thus the suc- 
cessive operations of performing a sum and taking an average yield 
the same result irrespective of the order in which they are carried out.} 
Similarly, if c is any constant, 


f= > Prlof(ts)) = eS Pfu) 


or 


(33) 


Thus the operations of multiplying by a constant and taking an aver- 
age can also be carried out in either order without affecting the result. 
If f = 1, the relation (33) makes the obvious assertion that the mean 
value of a constant is simply equal to this constant. 


} The mean value @ is time-dependent if the ensemble is time-dependent, ie., if some 
of the probabilities Ρ, depend on times Note also that the mean value or ensemble aver- 
age ii is an average over all systems in the ensemble at a particular time. It is ordinarily 
different from the time average defined in (1.6) for ἃ single system, except in the special 
case of time-independent ensembles where the time average is taken over ἃ very long 
interval of time. 


{In mathematical language one would say that these operations “commute.” 
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Example 


Consider a system of 4 spins when 
p= q=4. The number of moments 
pointing up can then be n = 0, 1, 2, 3, 
These numbers occur with probabilities 
P(n) which follow immediately from (16) 
and which were already calculated very 
simply in(1.4a). As indicated in Fig. 2.6, 
these probabilities are respectively, 


Ῥίη) = Ye, vs, Yes Ye: ve 


The mean number of moments pointing 
up is then 


4 
a= > Pn)n 


n=0 
= (Pe X 0) + (Ye x 1) + (fe x 2) 


+ (te X 3) + (ts x 4) 
=2 


Note that this result is simply equal to 
Np=4x}4. 

Since p = 4, there is no preferred di- 
rection in space, The mean numberof 
moments pointing down must thus be 
equal to the mean number of moments 
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pointing up, i.e., 
n=n=2 


This result also follows from the relation 
(32) which allows us to write 


w=N—-n=N 


=4-2=2 


Since there is no preferred direction in 
space, the mean magnetic moment must 
clearly vanish. Indeed, one has 


m n-w=2-2=0, 


The value of m could also be computed 
directly, of course, by using the proba- 
bility P’(m) that m assumes its possible 
values m = —4, —2,0, 2,4. Thus one 
has, by definition, 


m= S} P(m)m 


= [re Χ (—4)] + [τὸ x (-2)] 
+ [i x 0) + [re Χ 2) + [rs x 4] 


50. 


One last property of mean values is often of importance. Suppose 


that one is dealing with two variables, u and v, which can assume the 


values 


Uy, U2,.+-5Ue 


and 01, U2, +++ Up, 


respectively. Let us denote by P, the probability that u assumes the 
value u,, and by P, the probability that Ὁ assumes the value v,. If 
the probability that u assumes any of its values is independent of the 
value assumed by ἡ (i.e., if the variables u and v are statistically inde- 
pendent), then the joint probability P,, that u assumes the value u, and 
that Ὁ assumes the value Ὁ, is, by (5), simply equal to 


Prs = P,Ps. (34) 


Suppose now that f(u) is any function of τὶ while g(v) is any function 
of v. Then the mean value of the product fg is, by the definition (31), 
quite generally given by 


Fuge) 


a β 
Σ DPref (terlelvs) 


r=1s=1 


(35) 
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where the summation is over all possible values τι, and v, of the vari- 
ables. If the variables are statistically independent so that (34) is valid, 
Eg. (35) becomes 


Fe=D VPP F(metor 
=D DE FellPaetod 
= [Drsemn][SPeo- 


But the first of the factors on the right is simply the mean value of f, 
while the second is the mean value of g. Hence we arrive at the 
result that 


23 if u and v are statistically independent, 
Je = fe: 
ie., the average of a product is then simply equal to the product of 
the averages. 


(36) 


Dispersion 

Suppose that a variable u assumes its possible values u, with respective 
probabilities P,, Some general features of the probability distribution 
can then be characterized by a few useful parameters. One of these 
is simply the mean value of u itself, i.e., the quantity « defined in (30). 
This parameter indicates the central value of u about which the various 
values τι, are distributed. It is then often convenient to measure the 
possible values of u with respect to their mean value ἃ by putting 


Au=u-t (37) 
where Au is the deviation of u from the mean value δ. Note that the 
mean value of this deviation vanishes. Indeed, using the property (32), 

Δα = (ἃ -- ἢ) -- ἃ -- ἃ -- Ο. (38) 


It is also useful to-define a parameter which measures the extent of 
the spread of the possible values of u about their mean value i. The 
mean value of Au itself does not provide such a measure since Au is, 
on the average, as often positive as negative so that its mean value 
vanishes in accordance with (38). On the other hand, the quantity 
(Au)? can never be negative. Its mean value, defined by 


Gy = DP (auyr= S Plu -ὰρ (39) 
T= r= 


“ 
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is called the dispersion (or variance) of u and also can never be nega- 
tive since each term in the sum of (39) is nonnegative.t Thus 


(Aup > 0. (40) 


The dispersion can only vanish if all occurring values of u, are equal 
to δὲ; it becomes increasingly large to the extent that these values have 
appreciable probability of occurring far from i. The dispersion does, 
therefore, provide a convenient measure of the amount of scatter of 


~the values assumed by τι. 


The dispersion (Au)? is a quantity which has the dimensions of the 
square of u. A linear measure of the spread of possible values of u 
is provided by the square root of the dispersion, 1.6., by the quantity 


(au) (41) 


which has the same dimensions as uw itself and which is called the 
standard deviation of u. The definition (39) shows that even a few 
values of u occurring with appreciable probability far from ἃ would 
make a large contribution to Au. Most values of u must therefore 
occur within a range of the order of Au surrounding the mean value i. 


Example 
Let us return to the previous example οἱἑἑ  [(Δπη Ξε δ᾽ P(m)(m — 0)? 
four spins when p = q =4. Sincen = 2, m 
the dispersion of n is, by definition, = [ts x (—4)"] + bite x (—2)3] 
οὔτε + bie x (0)7] + fe x (2)"] 
An}? = δ᾽ P{n)(n — 2)2 
Ὁ Σ siemens + [ns x (4)7] 
= [ee x (-2)9] + he x ( 4] Pee 
50 that 
+ hie x (0)7] + [τς x ΟΣ] Am= V4=2. 


Let us verify that the preceding results 


+h xe are consistent. Since m= Ὁ while i = 
af i’ = 2, one has for all values of m or n 
Δπι τε πὶ τε πὶ -- π' =n—(4—n) 
Hence the standard deviation οἵ n is ae ΞΟ δῶν 
An= VI=1 or Am = 2(n — ii) = 2An. 


Ga = ae 
Simitarly one can calculate the dispersion ence (Am)? = 4(An)?, 


of the magnetic moment. Since m= 0, ἱπ agreement with what we found by ex- 
‘one has, by definition, plicit calculation. 


+ Note that (Au)? is different from the quantity (Δ): i.e., it makes a great deal of differ- 
ence whether one squares first and then takes the average, or whether one carries out these 
operations in the reverse order. 
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A knowledge of the probabilities P, for all values u, gives complete 
statistical information about the distribution of the values of u in the | 
ensemble. On the other hand, a knowledge of a few mean values such 
as 7 and (Au)? provides only a partial knowledge of the characteristics 
of this distribution and is not sufficient to determine the probabilities 
P, unambiguously, Such mean values, however, may often be calcu- 
lated very simply without an explicit knowledge of the probabilities, 
even in cases where the actual computation of these probabilities would 
be a difficult task, We shall illustrate these remarks in the next section. 


2.5 Calculation of Mean Values for a Spin System 
Consider an ideal system of N spins 3. The fact that these spins are 
statistically independent allows us to calculate various mean values 
quite simply under very general conditions. The calculation can be 
accomplished without the need of computing any probabilities such 
as the probability P(n) obtained in (14). 

Let us start from scratch then to investigate a physically interesting 
guantity of this spin system, its total magnetic moment M along the 


| up direction. Let us denote by μὲ the component along the up direc- 


tion of the ith spin. The total magnetic moment Mis then simply equal 
to the sum of the magnetic moments of all the spins so that 


Με μι ὁ pets + by 


or, in more compact notation, 
N 
M= = μι- (42) 
ἵξι 


We should like to calculate the mean value aud the dispersion of this 
total magnetic moment. 

To compute the mean value of M, we need only take the mean values 
of both sides of (42). The general property (32), which allows us to 
interchange the order of averaging and summing, immediately yields 
the result 


(43) 


But the probability that any magnetic moment has a given orientation 
(either up “or down) is the same for each moment; hence the mean 
magnetic moment is the same for each spin (i.e., fa = fe = --- = fy) 
and can be denoted simply by #. The sum in (43) consists, therefore, 
of N equal terms so that (43) becomes simply 
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This result is almost self-evident; it asserts merely that the mean total 
magnetic moment of N spins is N times as large as the mean moment 
of one spin. 

(2) Now let us calculate the’ dispersion of M, i.e., the quantity (AM)? 
where 


AM=M — ΜΝ. (45) 
Subtraction of (43) from (42) yields 


or (46) 


where 


(47) 
"To find (AM )?, we need only multiply the sum in (46) by itself. Thus 
(AM)? = (Apa + Ape +--+ + Apy) (Apa + Apa + +++ + Apy) 
= [(Δμιὴ)Σ + (Awe)? + (Apa)? + --- 4 (Apx)?] 
+ [Apa Ape + Apts Aug + +++ + Ap Any 
+ Ape Aur + Ape Aus + +++ + Apy Δμν. 4] 


N N ON 
or (AM)? =>" (Am)? +5) S)(Am)(Au). (48) 
ἐξ Ἐπ Ly j= 1 
The first term on the right represents all the squared terms arising 
from terms in the sum (46) which are multiplied by themselves; the 
τ΄ second term represents all the cross terms arising from products of 
different terms in the sum (46), Taking the mean value of (48) and 
using again the property (32) which allows us to interchange the order 
of averaging and summing, we then obtain 


N «ΟἿ. 
(ame Ξ Gu? +>) >} Gan). (49) 


ἵξι j= 
iy 


All products in the second sum, where i > j, refer to different spins. 
* | But since different spins are statistically independent, the property 
(36) implies that the mean value of each such product is simply equal 

| to the product of the mean values of its factors. Thus, 


for ij, (Δμηζδμὴ = (Δμιγ(δμὴ = 0 (50) 
since δμι = mG —p=0. 
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In shoft, each cross term in (49) vanishes on the avetage, being as often 
positive as negative. Hence (50) reduces simply to a sum of squared 
terms (none of which can be negative): 


N 
(AM)? = Σ Gui? (5) 

iz 
‘+ The argument now becomes identical to that Following Eq. (43). The 
_probability that any moment has any given orientation is the same for 
— moment; hence the pei (Ap;)? is the same for each spin 
lie., | (Qin? = Gin? = --+ = Guy)*] and can be denoted simply by 
(Gn). The sum jn (51) εὐθβνα thus of N equal terms and reduces 

simply to 


(MF = N@HE. (62) 


| This relation asserts that the dispersion of the total magnetic moment 


is merely N times as large as the dispersion of the magnetic moment 
of an individual spin. Correspondingly (52) also implies that 


AM = VN Ap (53) 
where AM= [(AM)?}1/2 and Ap = [{Δμ)3}.5 


are, in accordance with the general definition (41), the standard devia- 
tions of the total magnetic moment and of the magnetic moment per 
spin, respectively. 

_ _ The relations (44) and (53) show explicitly how M and AM depend 
on the total number N of spins in the system. When ji τέ 0, the mean 
total magnetic moment M increases proportionately to N. The stand- 

+ ard deviation AM (which measures the width of the distribution of 
values of M about their mean value Δ) also increases as N increases, 
but does so only proportionately to N/*, Hence the relative magni- 
tude of AM compared to M decreases proportionately to N~1/2; jpdeed 
(44) and (53) imply that 

\—> Ὰ 


for ji # 0, = too dM (δὴ 


Figure 2.8 illustrates these characteristic trends. 


(3. Note that the results (44) and (53) are very general. They depend 


only on the additive relation (43) and on the fact that the spins are 
statistically independent. All our considerations would thus remain 
equally valid even if the component μὲ of each magnetic moment 


-- 


ε 
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could assume many possible values. (This would be the case if the 
spin of each particle were gréater than } so that it could exhibit more 
than two possible orientations in space.) 


System of particles with spin 4 

The foregoing results are readily applied to the familiar, special ease] 
where each particle has spin 3. As usual we suppose that its magnetic 
moment then has probability p of pointing up so that μ᾽ = μο, and 
probability q = 1 — p of pointing down so that py; = —po. Its mean 
moment along the up direction is thus 


B= poo + α(-- μοὶ = (P— 4)μο = (2p — 1)po. (65) 
As a check we note that, in the symmetrical case where p = q, ji = Ὁ 
as expected. 
The dispersion of the magnetic moment of a spin is given by 
Gu? = — BP = ρίμο — μ)5 + φί--μο — μ). (58) 
But Ho — αὶ = Ho — 2p — 1)to = Ὡμο(! — p) = 20g, 
and fo + 2 = Ho + 2p — Ἰ)μὸ = Qpop. 


Thus (56) becomes 


(Gu? = ρίϑμοφ)" + g(2nap)? = 4uo?palg + p) 
or (Ap)? = 4pquo? (57) 


sincep +q=1. 
The relations (44) and (52) yield therefore the results 


M=Np— qo (58) 


and (AM)? = 4Npq μοῦ. (59) 
The standard deviation of M is accordingly 

AM = 2\/Npq μο. (60) 

If we write M = myo, sa that the integer m = M/jo expresses the 

total magnetic moment in units of μο, the results (58) through (60) can 

also be expressed in the form 

πὶ =Np — 4) =N@p — ἢ), (61) 

(Am? —4Npq, (62) 

ho =25/¥pq. (6) 
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These relations contain an appreciable amount of information about 
the distribution of the possible values of M or m in the ensemble of 
spin systems. Thus we know that only those values of m occur with 
appreciable probability which lie near m and do not differ from it by 
an amount much larger than Am. Figure 2.8 provides a specific 


illustration. 


Example 


Suppose that, in the presence of 8 certain 
applied magnetic field B, the magnetic 
moment of each spin has a probability 
p = 0.51 of pointing parafiel to B and a 
probability q = 1 — p = 0.49 of pointing 
antiparallel to B. The mean total mag- 
netic moment of 8 system of N spins is 
then 


M = 0.02Nj0. 


The standard deviation of its total mag- 
netic moment is given by (60) so that 


AM = 2\/Npq po = VNuo- 


| M=210" 44) 
_ AM=10"y, 


Ξ Hence 4M Vino _ _50_ 
‘M ~ 0.02Nuo VN 


M 


0 Ξ M 
2AM 


‘st Consider first a case where the total 

number of spins is fairly small. For ex- 
ample, suppose that N= 100. Then 

Fig, 2.9 The probability P”(M) that the total = 

magnetic moment of a spin system has a value ἊΜ 88 

M when N = 100 and when N = 1054, The Μ᾿ νοῦ 

magnetic field is such that Ὁ τ 0.51 and + sothatAM>M. The scatter in the pos- 

q = 0.49. The graphs indicate the envelope sible values of M is then very pro- 

curves showing the possible values of P’(M). nounced. Indeed, it is quite likely that 

The two graphs are not drawn to the same there occur values of M which differ 

scale. 


ai) 


widely from M and are even of opposite 
sign. (See Fig. 2.9.) 

On the other hand, consider the case 
of 8 macroscopic system of spins where 
Nis of the order of Avogadro's number, 
say N= 10%, Then 


SM 50 sy 10 


vier 

so that ΔΜ «- Μ. The scatter in the 
possible values of M is then very small 
relative to the mean total magnetic mo- 
ment. If we set out to measure the total 
magnetic moment of the system, we 
would thus almost always measure a 
value very close to M. Indeed, unless 
our method of measurement were suffi- 
ciently precise to detect differences of 
magnetic moment smaller than about 
one part in 1019, we would virtually al- 
ways measure a magnetic moment equal 
to M without becoming aware of the ex- 
istence of fluctuations about this value. 
This example illustrates concretely the 
general conclusion that the relatice mag- 
nitude of fluctuations tends to become 
very small in a macroscopic system con- 
sisting of very many particles. 


Distribution of molecules in an ideal gas 


Consider an ideal gas of N molecules contained in a box of volume Vo. 
We are interested in investigating the number n of molecules found 
within any specified subvolume V of this box (see Fig. 2.10). If the gas 
is in equilibrium, then the probability p of finding a molecule in this 
volume V is simply equal to 


(64) 


as mentioned previously in (28). 


ἡᾷ. 


a 


ι- 


μ- 
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It is very easy to calculate the mean value of n and its dispersion. 
We have already pointed out at the end of Sec. 2.3 that the problem 
of the ideal gas is analogous to that of the spin system. (Both prob- 
lems are of the type leading to the binomial distribution.) Hence we 
can immediately apply the results (61) and (62) to find the desired 
information about n. Let n’ denote the number of molecules in the 
remaining volume Vo — V of the box and let m=n—n’. As shown 
in (25), it then follows that 


n=H(N +m). (65) 
Using the result (61) for m, we then obtain 
a= HN +m) τ ἅν Ἐρ -- ῷ 
or (66) 


since q= 1—p. Furthermore, we obtain from (65) the relation 
An=n—ni=}(N+m) —}(N+ m) = 4m --- τὴ] 
or An = 4Am. 


Any? = 3am)? 


Hence 


and (62) implies thatt 


(An? = Npg. (67) 
The standard deviation of n is then 
An = VNpq (68) 
sett NN 
so that ay 5 (2) VN" (69) 


These relations show again that the standard deviation An increases 
proportionately to N1/2. Correspondingly, the relative value An/i 
of the standard deviation decreases proportionately to Ν "192 and thus 
becomes very small when N is large. These statements are well illus- 
trated by the special case of Chap. 1 where we considered the num- 
ber n of molecules contained in one half of a box. In this case (64) 
implies that p = q = 4 so that (66) reduces to the obvious result 


ni=4N 

An I: 
vhil ==. 
as π΄ JN 


{ The relations (66) and (67) could also be derived directly by the methods of this section 
without using the corresponding results for the quantity m (see Prob. 2.14). 


Fig. 2.10 A box of volume Vo contains Ν᾽ 
molecules of an ideal gas. At any given time 
some number n of the molecules are located in 
the subvolume V, while the remaining number 
n’ = N — nare located in the rest of the vol- 
ume V’ = Vo — V. 
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+ These relations put the discussion of fluctuations in Sec. 1.1 on a quan- + 
titative basis. The fact that the absolute magnitude of fluctuations, 
(measured by An) increases with increasing N while the relative mag- 

+ nitude of fluctuations (measured by An/n) decreases with increasing 
N is illustrated explicitly by the graphs of Figs. 1.5 and 1.6 for N = 4 

» and N = 40. When the box contains about a mole of gas, N is of the 
order of Avogadro’s number so that N ~ 1024. In this case the rela- 
tive magnitude of fluctuations An/n ~ 10. 12 becomes so small as to 
be almost always negligible. 


2.6 Continuous Probability Distributions 
Let us consider an ideal spin system consisting of a large number N 


Vas Sto: vn Qh\R_ 2S of spins ὁ. There are then many possible values of the total magnetic 
moment of this system. Indeed, by (22) and (24), 


+ Doservdsls 


sean 
τὰς She en anstic 


ἘΠ YA M = muy = (2n — N)to (70) 
so that M can assume any of the (N + 1) possible values 
fuc8M, Με - Ν μα, —(N— 3)μο, —(N — 4)μο,--..(Ν — 2)0, Νμο. (71) 


+ The probability P’(M) that the total magnetic moment assumes a par- 
ticular value M is equal to the probability of occurrence of the corre- 
sponding value of m or n, i.e., to P’(m) given by (26) or P(n) given 


by (14). 
Thus - Ρ΄(ΜῚ = Pm) = P(n), (ray 
where m= # and n=4(N +m). 


Except when M is near its extreme possible values +Nyo [where 

P’(M) is negligibly small], the probability P’(M) does not vary appre- 

= ciably in going from one possible value of M to an adjacent one; i.e., 

Puy. Py ΙΡΜ + 9μο) — ΡΜ) « ΡΜ). The envelope of the possible 

Srasetf, values of P’(M) then forms a smooth curve, as indicated in Fig. 2.11. 

Thus it is possible to regard P’(M) as a smoothly varying function of 

the continuous variable M, although only the discrete values (71) of M 
are relevant. 

Suppose that μο is negligibly small compared to the smallest magnetic 
moment of interest in any macroscopic measurement. The fact that 
M can assume only discrete values separated by 2) is then unobserv- 
able within the precision of contemplated observations. Thus M can 
υ- Fixe + indeed be considered as a continuous variable. Furthermore, one 

can meaningfully talk of a range dM which is a “macroscopic infini- 


ty 


a a dah i i ἼΜῊ 


+ 
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tesimal,” i.e., a quantity which is macroscopically very small although 
it is microscopically large. (In other words, dM is supposed to be 
negligibly small compared to the smallest magnetic moment of inter- 
est in a macroscopic discussion, although it is much larger than 1.) 
The following question is then of interest: What is the probability that 
the total magnetic moment of the system lies in a particular small range 


, fbetween|M and M + dM? The magnitude of this probability depends 
" obviously on the size of the range dM and must become vanishingly 


small as dM is made negligibly small. This probability is therefore 
expected to be simply proportional to dM so that it can be written in 
the form: 


— that the total magnetic 


moment lies between M and M + om = P(M) dM (73) 


where P(M) is independent of the magnitude of dM.{ The quantity 
P(M) is called a probability density; it yields an actual probability 
when it is multiplied by the infinitesimal range dM, 

The probability (73) is readily expressed explicitly in terms of the 
probability P’(M) that the total magnetic moment assumes the par- 
ticular discrete value M. Since (71) shows that the possible values of 
M are separated by amounts 29 and since dM > 2no, the range be- 
tween M and M + dM contains dM/(2p0) possible values of M. All 
of these occur with nearly the same probability P’(M) since the prob- 
ability is very slowly varying over the small range dM. Hence the 
probability that the total moment lies in the range between M and 
M + dM is simply obtained by summing P”(M) over all values of M 
lying in this range, 1.6., by multiplying the nearly constant value 
P'(M) by dM/(2p0). This probability is thus properly proportional to 
dM and is given explicitly by 


P(M) dM = P'(M ne =i 


(74) 


f It is worth noting that many differentials used in physics are macroscopic infinitesimals. 
For example} in the study of electricity, one often talks of the charge Q on a body and of 
ἃ. charge increment dQ. This differential description is valid if dQ is understood to be 
much larger than the discrete electronic charge ¢, although it is supposed to be negligibly 
small compared to Q itself. 

1 Since the probability is some smooth function of dM, it should be expressible near any|} 
value of M as a Taylor series in powers of dM when dM is small. ‘Thus it should be of the 
fom 

Probability = ay + a; dM + a,(dM)2 + ... 


where the coefficients ao,a;,... depend on M. Here ay = 0 since the probability must 
‘approach zero as dM is made very small; furthermore, terms involving higher powers of 
dM are negligibly small compared to the leading term which is proportional to dM. Hence 
one is led to the result (73). 
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dP 
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dM 

Fig, 3.11 The probability P’(M) that the total 
magnetic moment of a spin system has a value 
M in a case where the number N of spins is 
large and the magnetic moment up of a spin is 
relatively small, 


dM 


Fig, 2.12 The probability distribution of Fig. 
2.11 is shown expressed in terms of the proba- 
bility density P(M). Here P(M) dM [which 
is equal to the area under the curve in the small 
range between M and M + dM ]Jis the proba- 
bility that the total magnetic moment lies in 
the range between M and M + dM. 
— 


AP CAd y= POAr.dM 
«ει, Daan eo 
ΥᾺ σόν. Leon 


oa Nhe Risloales sty, \oehusee in 
NA οκκὰ oad depends 


on Υλ ona gy di 


eendente on Ais 
ousilalde yy POM) 
ὃς denis Swonction 


\, 


Moservalre 


Fig. 2.13 Subdivision of the domain of a con- 
tinuous variable τὶ into a countable number of 
infinitesimal equal intervals of fixed size διε. 
Each such interval is labeled by an index r 
which can assume the values 1, 2, 3, 4,.... 
The size of a macroscopic infinitesimal range 
du is also shown. 
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In practice, the actual calculation of P’(M) may be laborious if M/uo 
is large since the binomial distribution (14) then requires the evalua- 
tion of large factorials. These difficulties may, however, be circum- 
vented by using the Gaussian approximation of Appendix A.1. 

There are many problems where a variable of interest, call it u, is 
intrinsically continuous. For example, u might denote the angle be- 


+ tween some vector in a plane and a fixed direction; this angle could 


then assume any value in the domain between 0 and 27. In the gen- 
eral case u can assume any value in some domain a; < u < ap. (This 
domain may be infinite in extent, ie., a, > —00, or a2 — ©, or 
both.) Probability statements can be made about such a variable in 
a manner completely analogous to that discussed in the case of M. 


+ Thus one can focus attention on any infinitesimal range between τὶ 


and u + du and ask for the probability that the variable lies in this 


: range. When du is sufficiently small, this probability must again be 


proportional to du so that it can be written in the form P(u) du where 
the quantity P(u) is a probability density independent of the size of du. 
Probability considerations involving a continuous variable u can 
readily be reduced to the simpler situation where the possible values 
of the variables are discrete and thus countable, It is only necessary 
to subdivide the domain of possible values of u into arbitrarily small 
equal intervals of fixed size 6u. Each such interval can then be labeled 
by some index τ. The value of τὶ in this interval can be denoted by u, 
and the probability that τὶ lies in this interval by P, or P(u,). This 
procedure allows us to deal with a countable set of values of the vari- 
able u, each such value corresponding to one of the infinitesimal inter- 
vals r = 1,2,3,.... It also becomes apparent that relations involving 
probabilities of discrete variables remain equally valid for probabilities 
of continuous variables. For example, the simple properties (32) and 
(33) of mean values are also applicable if u is a continuous variable. 
Note that the sums involved in calculating normalization conditions 
or mean values can be expressed as integrals if the variable is continu- 
ous. For example, the normalization condition asserts that the sum 
of the probabilities over all possible values of the variable must equal 
unity; in symbols ; 
Ru) = 1. = Comelaitd (75) 
But, if the variable is continuous, one can first sum over all discrete 
intervals r for which τιν lies in the range between u and u + du; this 
gives the probability P(u) du that the variable lies in this range.t One 


+ Here the range du is understood to be large compared to the arbitrarily small inter- 
val du (so that du > δι), but to be sufficiently small so that P(u;) does not vary apprecia- 
bly within the range du. 
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can then complete the sum (75) by summing (ie., integrating) over all 
such possible ranges du. Thus (75) is equivalent to 


[ἥρω du=1 (76) 


which expresses the normalization condition in terms of the proba- 
bility density P(u). Similarly, the general definition (31) of the mean 
value of a function f(t) of discrete variables is given by 


FMS > Ῥω" (ur). (77) 


In a continuous description one can again sum first over all intervals r 
for which u, lies in the range between u and u + du; this contributes 
to the sum an amount P(t) du f(u). One can then complete the sum 
by integrating over all possible ranges du. Hence (77) is equivalent 
to the relationt e 


Τῶ = fu) Fw) au. (78) 


"5 
Generalization to the case of several variables ἊΣ Afra Corre aradachs o® EN) otfer "> 


The generalization of the Previous re-. 
marks to the case of more than one 
variable is immediate. Suppose, for ex- 
ample, that one is dealing with two con- 
tinuous variables u and v. Then the 
joint probability that the variable wu lies 
in the small range between u and u + du 
and that the variable υ lies in the small 
range between v and » + dv is Propor- 
tional to both du and dv and can be 
written in the form P(u,v) duds where 
(u,v) is a probability density independ- 
ent of the size of du or dv. If desired, 
the situation can again be reduced to one 
of discrete probabilities by subdividing 
the variable u into very small fixed inter- 
vals du, each of which can be labeled by 
some index r, and by subdividing the 
variable © into very small fixed intervals 
5v, each of which can be labeled by some 
other index s. In these terms the situa- 
tion can then be described by specifying 
the probability P,, that the variables have 
values lying in any given cell labeled by 
the pair of indices r and s. 


t Note that the probability density P(u) may become infinite for certain values of μι. 

es 
This does not lead to any difficulties as long as any integral f P(u) du (which gives the 
Probability that the value of εἰ lies in an arbitrary range between c; and ca) remains finite. 


Fig, 2.14 Subdivision of the continuous vari- 
ables u and v into small equal intervals of mag- 
nitude 6u and 6v labeled by 7 and 5, respec- 
tively. The uo plane is thus subdivided into 
small cells, each labeled by the pair of indices 


rands. 
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Summary of Definitions 


statistical ensemble An assembly of a very large number of mutually noninteracting 
systems, each of which satisfies the same conditions as those known to be satisfied by a 
particular system under consideration. 

time-independent ensemble An ensemble in which the number of systems exhibiting 
any particular property is the same at any time. 

event The outcome of an experiment or result of an observation. 

probability The probability P, of occurrence of an event r in a system is defined with 
respect to a statistical ensemble of 4” such systems. If «ἡ, systems in the ensemble 
exhibit the event τσ, then 


(where «γ᾽ -- 00). 
statistical independence ‘Two events are statistically independent if the occurrence of 


‘one event does not depend on the occurrence or nonoccurrence of the other event. 
mean value (or ensemble average) The mean value of εἰ is denoted by ii and defined as 


a=> Pau, 
τ 

where the sum is over all possible values u, of the variable τὶ and where P, denotes the 

probability of occurrence of the particular value u,. 
dispersion (or variance) The dispersion of u is defined as 

[δώξ Ξε δ᾽ Pru, -- a. 
τ 
standard deviation The standard deviation of εἰ is defined as 
u= [{Δ)3}.,5, 

probability density The probability density P(u) is defined by the property that P(u) du 


yields the probability of finding the continuous variable u in the range between u and 
u+ du. 


Important Relations 


Given N statistically independent events, each having probability p of occurring 
(and probability q = 1 — p of not opening): 


Probability that n of these N erent occur (binomial distribution): 
Pn) = Nar ΤΠ pg. @) 
Mean number of occurring events: ἢ = Np. (i) 


Standard deviation of n: An = VNpq. (iii) 
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Suggestions for Supplementary Reading 

W. Weaver, Lady Luck (Anchor Books, Doubleday & Company, Inc., Garden City, N.Y., 
1963). An elementary introduction to probability concepts. 

F. Mosteller, R. E. K. Rourke, and G. B. Thomas, Probability and Statistics (Addison- 
Wesley Publishing Company, Reading, Mass., 1961). 

F. Reif, Fundamentals of Statistical and Thermal Physics, chap. 1 (McGraw-Hill Book 
Company, New York, 1965). ‘The random walk problem discussed in this book is analo- 
gous to the problem of the ideal spin system, but is treated at greater length. 

H. D. Young, Statistical Treatment of Experimental Data (McGraw-Hill Book Company, 
New York, 1962). An elementary account of statistical methods, particularly as they 
are applied to problems of experimental measurement. 

W. Feller, An Introduction to Probability Theory and its Applications, 2d ed. (John Wiley 
and Sons, New York, 1959). This book on probability theory is more advanced than the 
preceding ones, but discusses many concrete examples. 


Problems 


2.1 An elementary dice problem 
What is the probability of throwing a total of 6 points or less with 3 dice? 


2.2 Random numbers 
A number is chosen at random between 0 and 1. What is the proba- 
bility that exactly 5 of its first 10 decimal places consist of digits less than 5? 


2.3 Tossing of dice 
Assume that each face of a die is equally likely to land uppermost. Con- 
sider a game which involves the tossing of 5 such dice. Find the probability 
that the number “6” appears uppermost 
(a) in exactly one die, 
(b) in at least one die, 
(c) in exactly two dice. 


3.4 Probability of survival 
In the macabre game of Russian roulette (not recommended by the 

author) one inserts a single cartridge into one of the 6 chambers of the cylinder 
of a revolver, leaving the other 5 chambers of the cylinder empty. One then 
spins the cylinder, aims at one’s head, and pulls the trigger. What is the prob- 
ability of still being alive after playing the game 

(a) Once? 

(b) Twice? 

(c) N times? 

(d) What is the probability of being shot when playing the game the Nth time? 
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2.5 The random walk problem 
‘A man starts out from a lamppost in the middle of a street, taking steps 

of equal length 1. The probability is p that any one of his steps is to the right, 
and q = 1—p that it is to the left. ‘The man is so drunk that his behavior at 
any step shows no traces of memory of what he did at preceding steps. His 
steps are thus statistically independent. Suppose that the man has taken N steps. 

(a) What is the probability P(n) that n of these steps are to the right and the 
remaining π' = (N — n) steps are to the left? 

(b) What is the probability P(m) that the displacement of the man from the 
lamppost is equal to ml, where m = n — n’ is an integer? 


2.6 Probability of returning to starting point 
In the preceding problem, suppose that p = 4 50 that each step is equally 
likely to be to the right or to the left. What is the probability that the man will 
again be at the lamppost after taking N steps 
(a) If Nis even? 
(b) If Nis odd? 


2.7 One-dimensional diffusion of an atom 

Consider a thin copper wire stretched out along the x axis. A few of 
the copper atoms, located near x = 0, are made radioactive by bombardment 
with fast particles. When the temperature of the wire is raised, the atoms be- 
come more mobile. Each atom can then jump to an adjacent lattice site, either 
to the site on its right (i.e., in the +x direction) or the site on its left (i.e., in the 
—x direction). The possible lattice sites are separated by a distance I, Assume 
that one has to wait a time 7 before a given atom jumps to an adjacent lattice 
site. This time 7 is a rapidly increasing function of the absolute temperature of 
the wire. The process whereby the atoms move about by virtue of successive 
jumps to adjacent sites is known as diffusion. 

Suppose that the wire is rapidly raised to some high temperature at some time 
t = 0 and is thereafter maintained at this temperature. 

(a) Let P(x) dx denote the probability that a radioactive atom is found at a dis- 
tance between x and x + dr after a time t. [We assume that 679» τ for all times t 
of physical interest, since τ is quite short when the temperature of the wire is 
high.] Make a rough sketch showing the pehavior of P(x) as a function of x in 
the following three cases: - : 

(1) shortly after the time t = 0; 

(2) after a moderately long time t has elapsed; 

(3) after an extremely long time t has elapsed. 

(b) What is the mean displacement x from the origin of a radioactive atom 
after a time t? 

(c) Find an explicit expression for the standard deviation Ax of the displace- 
ment of a radioactive atom after a time ft. 
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2.8 Computation of the dispersion 
Use the general properties of mean values to show that the dispersion of 
τ can be calculated by the general relation 


(Au)? = (u — τὴ = 2 -- a, ῶ 


The last expression on the right provides a simple method for computing the 
dispersion. 
Show also that (i) implies the general inequality 


w > τῷ, (ii) 


2.9 Mean values for a single spin 
The magnetic moment of a spin } is such that its component μι in the up 
direction has probability p of being equal to 1o, and probability g = 1 — p of 
being equal to — μο. 
(a) Calculate ji and p2. 
(Ὁ) Use the expression (i) of Prob, 2.8 to calculate (Δμ)2. Show that your 
result agrees with that given in Eq. (57) of the text. 


2.10 The inequality τι > a? 
Suppose that the variable w can assume the possible values tu, with 
respective probabilities P,. 
(a) By using the definitions of ὦ and u®, and remembering the normalization 


requirement ΣΡ = 1, show that 
@-—@=1 SD PP(u,— u,) @ 
275 
where each sum extends over all possible values of the variable u. 
(Ὁ) Since no term in the sum (i) can ever be negative, show that 
ὩΣῈ τ (ii) 


where the equals sign applies only in the case where only one value of εἰ occurs 
with nonzero probability. The result (ii) agrees with that derived in Prob. 2.8. 


2.11 The inequality (u")? < ἀπττμππι 
The result (i) of the preceding problem suggests an immediate generali- 
zation. Thus consider the expression 


DD PrPsuymuy tty — te)? @) 
ἘΞ 
where m is any integer. When πὶ is even, this expression can never be negative; 


when m is odd, it can never be negative if the possible values of τι are all non- 
negative (or all nonpositive). 
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(a) By performing the indicated multiplications in (i), show the following 
results: 

(ary? < writ (i) 
where n=m + 1. If nis odd, this inequality is always valid; if n is even, it is 
valid if the possible values of τὶ are all nonnegative (or all nonpositive). The 
equals sign in (ii) applies only in the case where a single possible value of τὶ 
occurs with nonzero probability. 

(b) Show that (ii) implies, as a special case, the inequality 
fi\s2 εἴ 
()::. (i) 
valid whenever the possible values of u are all positive (or all negative). The 
equals sign applies in the special case where only one value of u occurs with 
nonzero probability. 


2.12 Optimum investment method 
‘The following practical situation illustrates that different ways of aver- 

aging the same quantity can lead to significantly different results. Suppose that 
one desires to invest money by buying at the beginning of every month a certain 
number of shares of stock in a company. The cost ον per share depends, of 
course, on the particular month r and varies from month to month in a fairly 
unpredictable fashion. Two alternative methods of regular investment suggest 
themselves: In method A, one buys the same number s of shares of stock every 
month; in method B, one buys stock for the same amount of money m every 
month. After N months, one will then have acquired a total number S of shares 
of stock and will have paid for them a total amount of money M. The best in- 
vestinent method is clearly that which yields the largest number of shares for 
the lowest amount of money, i.e., which yields the largest ratio S/M. 

(a) Obtain an expression for the ratio $/M in the case of method A. 

(b) Obtain an expression for the ratio S/M in the case of method B. 

(c) Show that method B is the better investment method, no matter how the 
cost of shares fluctuates from month to month. [Suggestion: Exploit the ine- 
quality (iii) of the preceding problem.] 


2.13 System of nuclei with spin I 

Consider a nucleus having spin 1 (i.e., spin angular momentum h). Its 
component μ of magnetic moment aldng a given direction can then have three 
possible values, namely +o, 0, or —o. Suppose that the nucleus is not spheri- 
cally symmetrical, but is ellipsoidal in shape. As ἃ result it tends to be prefer- 
entially oriented so that its major axis is parallel to a given direction in the 
crystalline solid in which the nucleus is located. There is thus a probability p 
that μ = flo, and a probability p that μ = —po; the probability that μ = Ois then 
equal to 1 — 2p. 
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(a) Calculate ji and μξ. 

(b) Calculate (Qp)?. 

(c) Suppose that the solid under consideration contains N such nuclei which 
interact with each other to a negligible extent. Let M denote the total com- 
ponent of magnetic moment, along the specified direction, of all these nuclei. 
Calculate M and its standard deviation AM in terms of N, p, and pio. 


2.14 Direct calculation of πὶ and (An)? 
Consider an ideal system of N identical spins }. The number n of mag- 
netic moments which point in the up direction can then be written in the form 


n=) +t2+--- + Uy (i) 


when u; = 1 if the ith magnetic moment points up and u; = 0 if its points down, 
Use the expression (i) and the fact that the spins are statistically independent to 
establish the following results. 

(a) Show that i = Nu. 

(b) Show that (An)? = N(Au)?. 

(c) Suppose that a magnetic moment has probability p of pointing up and 
probability q = 1 — p of pointing down. What are ἃ and (Au)?? 

(d) Calculate πὶ and (An)? and show that your results agree with the relations 
(66) and (67) found in the text by a less direct method. 


2.15 Density fluctuations in a gas 

Consider an ideal gas of N molecules which is in equilibrium within a 
container of volume Vo. Denote by n the number of molecules located within 
any subvolume V of this container. The probability p that a given molecule is 
located within this subvolume V is then given by p = V/Vo. 

(a) What is the mean number fi of molecules located within V?_ Express your 
answer in terms of N, Vo, and V. 

(b) Find the standard deviation An in the number of molecules located within 
the subvolume V. Hence calculate An/n, expressing your answer in terms of 
N, Vo, and V. 

(c) What does the answer to part (b) become when V< Vo? 

(d) What value should the standard deviation An assume when V — Vo? 
Does the answer to part (b) agree with this expectation? 


2.16 Shot effect : 

Electrons of charge δ are emitted at random from the hot filament of 
a vacuum tube. To good approximation the emission of any one electron does 
not affect the likelihood of emission of any other electron. Consider any very 
smal] time interval At. Then there is some probability p that an electron will be 
emitted from the filament during this time interval (and probability q = 1 — p 
that an electron will not be emitted). Since At is very small, the probability p 
of emission during this time interval is very small (i-e., p<<_1) and the probability 
that more than one electron will be emitted during the time At is negligible. 
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Consider any time t which is much larger than At. During this time there 
are then N = ¢/At possible time intervals At during which an electron can be 
emitted. The total charge emitted in time { can then be written as 


ΟΞ φι ἐ 4: ὁ qat --- τὴν 


where q; denotes the charge which is emitted during the ith interval At; thus 
qi = ¢ if an electron is emitted and q; = 0 if it is not. 

(a) What is the mean charge Ὁ emitted from the filament during the time t? 

(b) What is the dispersion (AQ)? of the charge Q emitted from the filament 
during the time t? Make use: of the fact that p< 1 to simplify the answer to 
this question. 

(c) The current I emitted during the time ὁ is given by Q/t. Relate thus the 
dispersion (AT)? of the current to the mean current I, showing that 


(Δηξ = ἜΣ 


(4) The fact that the current measured during any interval t exhibits fluctua- 
tions (which become more pronounced the shorter the time interval, i.e., the 
fewer the total number of individual electrons involved in the emission process) 
is known as the shot effect. Calculate the standard deviation AI of the current 
if the mean current J = 1 microampere and the measuring time is 1 second. 


2.17 Calculation of a mean square value 

A battery of total emf V is connected to a resistor R. As a result, an 
amount of power P = V2/R is dissipated in this resistor. The battery itself 
consists of N individual cells connected in series so that V is just equal to the 
sum of the emf's of all these cells. The battery is old, however, so that not all 
the cells are in perfect condition. Thus there is only a probability p that the 
emf of any individual cell has its normal value Ὁ; and a probability (1 — p) that 
the emf of any individual cell is zero because the cell has become internally 
shorted. The individual cells are statistically independent of each other. Under 
these conditions, calculate the mean power P dissipated in the resistor. Express 
the result in terms of N, v, p, and R. 


2.18 Estimate of error of measurement 

A man attempts to lay off a distance of 50 meters by placing a meter 
stick end to end 50 times in succession. This procedure is necessarily accom- 
panied by some error. Thus the mhn cannot guarantee a distance of precisely 
one meter between the two chalk marks which he makes each time he places 
the meter stick on the ground. He knows, however, that the distance between 
the two marks is equally likely to lie anywhere between 99.8 and 100.2 cm, and 
that it certainly does not lie outside these limits. After repeating the operation 
50 times, the man will indeed have laid off a mean distance of 50 meters. To 
estimate his total error, calculate the standard deviation of his measured distance. 
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2.19 Diffusion of a molecule in a gas 
A molecule in a gas is free to move in three dimensions. Let s denote 
its displacement between successive collisions with other molecules, Displace- 
ments of the molecule between successive collisions are, to fair approximation, 
statistically independent. Furthermore, since there is no preferred direction in 
space, a molecule is as likely to move in a given direction as in the opposite direc- 
tion. Thus its mean displacement 5 = 0 (i.e., each component of this displace- 
ment vanishes on the average so that 5, = ὃν = δ, = 0). 
The total displacement R of the molecule after N successive displacements 
can then be written as 


R= 51+ 5) +534 τ. + sy 


where s; denotes the ith displacement of the molecule, Use reasoning similar 
to that of Sec. 2.5 to answer the following questions: 
(a) What is the mean displacement R of the molecule after N displacements? 
(b) What is the standard deviation AR = (R — ἘΠ of this displacement after 
N collisions? In particular, what is AR if the magnitude of each displacement s 
has the same length 1? 


2.20 Displacement distribution of random oscillators 
The displacement x of a classical simple harmonic oscillator as a func- 
tion of the time t is given by 


x = A cos (wt + φ) 


where ὦ is the angular frequency of the oscillator, A is its amplitude of oscilla- 
tion, and is an arbitrary constant which can have any value in the range 
Ὁ << 2m. Suppose that one contemplates an ensemble of such oscillators 
all of which have the same frequency w and amplitude A, but which have random 
phase relationships so that the probability that @ lies in the range between p 
and @ + ἀφ is given simply by dp/(2z). Find the probability P(x) dx that the 
displacement of an oscillator, at any given time t, is found to lie in the range 
between x and x + dx. 
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Statistical Description of Systems of Particles 


The preceding review of basic probability concepts has prepared us 
to turn the qualitative considerations of the first chapter into a syste- 
matic quantitative theory of systems consisting of very many particles. 
Our aim will be to combine statistical considerations with our knowl- 
edge of the laws of mechanics applicable to the particles constituting 
a macroscopic system. The resulting theory, therefore, is called sta- 
tistical mechanics. The reasoning that leads to this theory is very 
simple and uses only the most primitive ideas of mechanics and prob- 
ability. The beauty of the subject lies precisely in the fact that argu- 
ments of great simplicity and apparent innocence are capable of 
yielding results of impressive generality and predictive power. 

Indeed, the arguments used to discuss a macroscopic system are 
completely analogous to those used to discuss the familiar experiment 
of tossing a set of coins. The essential ingredients for an analysis of 
this experiment are the following: 


(i) Specification of the state of the system 

We must have available a method for specifying any of the possible 
outcomes of an experiment involving the system. For example, the 
state of the set of coins after any toss can be described by specifying 
which particular face of each coin is uppermost. 


(ii) Statistical ensemble 

We have far too little information about the precise manner in which 
the coins are tossed to be able to use the laws of mechanics to make a 
unique prediction about the outcome of any particular experiment. 
Hence we resort to a statistical description of the situation. Instead 
of considering the particular set of coins of interest, we thus focus atten- 
tion on an ensemble consisting of.a very large number of similar sets 
of coins subjected to the same experiment. We can then ask for the 
probability of occurrence of any experimental outcome. This proba- 
bility can be measured by observing the ensemble and determining 
the fraction of systems exhibiting the particular outcome. Our theo- 
retical aim is the prediction of any such probability. 


Γ 
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(iii) Statistical postulates ° 

To make theoretical progress, it is necessary to introduce some postu- 
lates. In the case of ΤΣ coins of uniform density, there is nothing 
‘intrinsic in the laws of mechanics which implies that one face of a coin 
should land uppermost more frequently than the other. Hence we are 
led to introduce the postulate that “a priori” (i.e., based on our prior 
notions as yet unverified by actual observations) a coin has equal prob- 
ability of landing on each of its two faces. This postulate is eminently 
reasonable and certainly does not contradict any of the laws of me- 
chanics. The actual validity of the postulate can, however, only be 
decided by using it to make theoretical predictions and by checking 
that these predictions are confirmed by experimental observations. To 
the extent that such predictions are consistently verified, the validity 
of this postulate can be accepted with confidence. 


(iv) Probability calculations 

Once the basic postulate has been adopted, we can calculate the prob- 
ability of occurrence of any particular outcome involving the set of 
coins under consideration. We can also compute various mean values 
of interest. Thus we can answer all questions that can meaningfully 
be asked in a statistical theory. 


In studying systems consisting of a large number of particles, our 
considerations will be very similar to those used in formulating the 
preceding problem of a set of coins. The next four sections will make 
this analogy explicit. 


3.1 Specification of the State of a System 

The study of atomic particles has shown that any system of such par- 
ticles is described by the laws of quantum mechanics, These laws, 
whose validity. is supported by an overwhelming amount of experi- 
mental evidence, will thus form the conceptual basis for our entire 
discussion. 

In a quantum-mechanical description the-most precise possible 
measurement on a system always shows this system to be in some one 
of a set of discrete quantum states characteristic of the system. The 
microscopic state of a system can thus be described completely by 
specifying the particular quantum state in which the system is found. 


Fig, 3.1 Highly schematic diagram illustrat- 
ing the first few energy levels of an arbitrary 
system. Each line denotes a possible quantum 
state of the system, while the vertical position 
of the line indicates the energy E of the system 

in this state. Note that there are many states 
which have the same energy. 


Table 3.1 Quantum states of a single 


spin 
} having a magnetic moment jo and located 
in a magnetic field B. Each state of the sys- 
tem can be labeled by an index r, or alterna- 


_ tively, by the quantum number σ. The 
~ magnetic moment (along the “up” direction 

specified by the field B) is denoted by M; the 
_ total energy of the system is denoted by E. 


Fig. 3.2 Diagram showing the two energy 
levels of a spin ᾧ having a magnetic moment 
Ho and located in a magnetic field B. The 
state where the magnetic moment points “up,” 
so that its direction is parallel to B, is denoted 
by o = +1 (or simply by +); that where it 
points “down” is denoted by = —1 (or sim- 
* ply by —). 
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Each quantum state of an isolated system is associated with a defi- 
nite value of its energy and is called an energy level.t_ There may be 
several quantum states corresponding to the same energy of the sys- 
tem. (These quantum states are then said to be degenerate.) Every 
system has a lowest possible energy. There is usually only one possible 
quantum state of the system corresponding to this lowest energy; this 
state is said to be the ground state of the system. In addition there 
are, of course, many (indeed, ordinarily infinitely many) possible states 
with higher energies; these are called the excited states of the system. 

: The preceding comments are completely general and applicable to 
any system, no matter how complex. They are best illustrated by 
some simple examples of great practical interest. 


(i) Single spin 


Consider a single particle, assumed to 
be fixed in position, which has spin and 
8 magnetic moment of magnitude ji. 
As already discussed in Sec. 1.3, this 
moment will be found to point either 
“up” or down" (i.e., parallel or antipar- 
allel) with respect to any specified direc- 
tion. The system consisting of this single 
spin thus has only two quantum states 
which we shall label by 8 quantum num- 
ber σ. We can then denote the state 
where the magnetic moment of the par- 
ticle points up by σ = +1, and the state 
where it points down by o = —1. 

- ΜΕ the particle is in the presence of a 
magnetic field B, this field specifies the 


direction of physical interest in the prob- 
lem. The energy E of the system then is 
lower when the magnetic moment is 
aligned parallel to the field rather than 
antiparallel to it. The situation is analo- 
gous to that of a bar magnet located in 
an external magnetic field. Thus, when 
the magnetic moment points up (ie., 
parallel to the field B), its magnetic en- 
ergy is simply —}oB. Conversely, when 
the moment points down (i.e., antiparal- 
lel to the field B), its magnetic energy is 
simply μοΒ. The two quantum states (or 
energy levels) of the system then corre- 
spond to different energies. 


+The hydrogen atom is likely to be a familiar example of a system described in terms 
of discrete energy levels. Transitions of the atom between states of different energy give 
rise to the sharp spectral lines emitted by the atom. A description in terms of energy levels 
is, of course, equally applicable to any atom, molecule, or system consisting of many atoms. 
{In some cases there may be a relatively small number of quantum states of the same 
‘energy equal to the lowest possible energy of the system. The ground state of the system 


is then said to be degenerate. 
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Gi) Ideal system of N spins 


Consider a system consisting of N parti- 
cles, assumed to be fixed in position, 
where each particle has spin } and a 
magnetic moment jo. The system is lo- 
cated in an applied magnetic field B. 
The interaction between the particles is 
assumed to be almost negligible.} 

The magnetic moment of each particle 
can point either up or down with respect 
to the field B. The orientation of the ith 
moment can thus be specified by the 
value of its quantum number σι so that 
σι = +1 when this moment points up and 
σι = —1 whenit points down. A particu- 
lar state of the whole system can then be 
specified by stating the orientation of 


(iii) Particle in a one-dimensional box 


Consider a single particle, of mass m, 
free to move in one dimension. The par- 
ticle is supposed to be confined within a 
box of length L, so that the particle's 
Position coordinate x must lie in the 
range O<x<L. Within this box the 
particle is subject to no forces. 

In 8 quantum-mechanical description, 
the particle has wave properties associ- 
ated with it. The particle confined within 
the box and bouncing back and forth be- 
tween its walls is thus represented by a 
wave function yin the form of a standing 
wave whose amplitude must vanish at 
the boundaries of the box (since y itself 
must vanish outside the box).¢ The 
wave function thus must be of the form 
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each of the N moments, i.e., by specify- 
ing the values assumed by the set of 
quantum numbers {o3,02,..., 0}. Thus 
one can enumerate, and label by some 
index στ, all the possible states of the 
whole system. This is done in Table 3.2 
for the special case where N= 4. The 
total magnetic moment of the system is 
simply equal to the sum of the magnetic 
moments of the individual spins. Since 
the interaction between these spins is 
almost negligible, the total energy E of 
the system is also simply equal to the 
sum of the energies of the individual 
‘spins. 


W(x) = Asin Kx ΠῚ 

(where A and K are constants) and must 
satisfy the boundary conditions 

WO)=0 τά Y{L)=0. (2) 


‘The expression (1) obviously satisfies the 


condition ψ(Ο) = 0. In order that it also 
satisfy the condition (L) = 0, the con- 
stant K must be such that 


KL=an 


K=2 
or Zn, 8 
where n can assume any of the integral 
values§ 


n= 1,2,3,4,.... (4) 


This assumption implies that one can virtually neglect the magnetic field at the posi- 
tion of any one particle caused by the magnetic moments of the other particles. 

4 The physical interpretation of the wave function is that |y(x)|? dx represents the prob- 
ability that the particle is found in the range between x and x + dz. 

§ The value n = 0 is irrelevant since it leads to Ψ = 0, i.e., to no wave function (or no 
particle) existing within the box. Negative integral values of n lead to no distinct new 
wave functions since a change of sign of n, and thus of K, leads merely to a change of sign 
of ¥ in (1) and leaves the probability |y|? dr unchanged. Hence the positive integral values 
of n yield all the distinct possible wave functions of the form (1). Physically this means 
that only the magnitude AK of the particle’s momentum is relevant, since this momentum 
ἐς equally likely to be positive or negative as a result of successive reflections of the particle 
by the walls. ὰ 


Table 3.2 Quantum states of an ideal system 
of 4 spins ὁ, each having a magnetic moment j1p 
and located in a magnetic field B. Each 
quantum state of the whole system is labeled 
by the index r, or equivalently, by the set of 
4 numbers {03, 02, 03, 64}. For the sake 
of brevity, the symbol + indicates σ = +1 
and the symbol — indicates σ = —1. The 
total magnetic moment (along the “up” direc- 
tion specified by B) is denoted by M; the total 
energy of the system is denoted by E. 


Fig. 3.3 A box in the shape of a rectangular 
parallelepiped with edge lengths L,, Ly, and L.. 
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The constant K in (1) is the wave num- 
ber associated with the particle; it is re- 
lated to the wavelength A (the so-called 


de Broglie associated with the 
particle) by the relation 
γε 

κε ϑ-, 6) 


Hence (3) is equivalent to 


and represents merely the familiar con- 
dition that standing waves are obtained 
when the length of the box is equal to 
some integral multiple of half-wave- 
lengths. 

The momentum p of the particle is re- 
lated to K (or A) by the famous de Broglie 
relation 

h 
patK=> (6) 
where ἢ = h/2n and h is Planck's con- 
stant. The energy E of the particle is 
simply its kinetic energy, since there is 
no potential energy due to external 
forces. Hence E can be expressed in 
terms of the velocity » or momentum 
p = mo of the particle as 


The possible values (3) of K then yield 
the corresponding energies 


(iv) Particle in a three-dimensional box 


The generalization of the preceding prob- 
lem to the case of a particle free to move 
in three dimensions is straightforward. 
Suppose that the particle is confined 
within a box in the shape of a rectangular 
parallelepiped of edge lengths L,, L,, and 
1... The position coordinates x, y,z@t the 


Σ gi? ni? 
) tamu © 

Equivalently, we could have discussed 
the whole problem from a more mathe- 
matical point of view by starting from the 
fundamental Schrédinger equation for 
the wave function y. For a free particle 
in one dimension this equation is 


The functional form (1) satisfies this 
equation provided that the energy E is 


related to K by (7), The condition (2) « 


that the wave function must vanish at 
the boundaries of the box leads again to 
(3) and hence to the expression (8) for 
the energy. 

The possible quantum states of the 
particle in the box thus can be specified 
by the possible values (4) of the quantum 
number n. The corresponding discrete 
energies of these states (i.e., the corre- 
sponding energy levels of the particle) 
are then given by (8). 

The relation (8) shows that the separa- 
tion in energy between successive quan- 
tum states of the particle is very small if 
the length L of the box is of macroscopic 
size. The lowest possible energy of the 
particle, i.e., its ground-state energy, cor- 
responds to the state n=1. Note 
that this ground-state energy does not 
vanish.¢ 


particle then can be assumed to lie in the 
respective ranges 
O<x<lL, O<y<k, O<2<L. 


The particle has mass m and is subject 
to no forces within the box. 


4 This conclusion is consistent with the Heisenberg uncertainty principle (Ax Ap > fi) 
according to which a particle confined within a linear dimension of length L (so that 
Ax ~ L) must have associated with it a minimum momentum p of the order of p ~fi/L. 
Hence the lowest possible energy of the particle in the box is a kinetic energy of the order 


of p?/2m = h2/2mL2. 
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The wave function of the particle now 
represents a standing wave in three di- 
mensions. Thus it is of the form 


¥ = A(sin K.2)(sin Kyy)(sin Kz) (9) 


where the constants K,,K,,K; can be 
regarded as the three components of 8 
vector K, the wave vector of the particle. 
According to the de Broglie relation the 
momentum of the particle is then given 
by 

p=aK (0) 
so that the relationship between the mag- 
nitude of p and the magnitude of K (or 


the wavelength A) is the same as in (6). 
The energy of the particle is then given by 


pe PRR 


2m 2m 


=P K24K2+K2). (ἢ) 
2m 

Equivalently, it can be immediately veri- 

fied that Ψ in (9) is indeed a solution of 

the time-independent Schrédinger equa- 

tion for a free particle in three dimen- 


sions, 
ay ay 
os ae ἢ a τ)- τ τ: 


provided that Ε is related to K by (11). 


(v) Ideal gas of N particles in a box 


Consider a system consisting of N par- 
ticles, each of mass m, confined within 
the box of the preceding example. The 
interaction between the particles is as- 
sumed to be almost negligible so that the 
particles constitute an ideal gas. The 
total energy E of the gas is then just 
equal to the sum of the energies of the 
individual particles, i.e., 
E=a+@+4+--- 


+e (16) 


where εἰ denotes the energy of the ith 
particle. The state of each such particle 
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The fact that Y must vanish at the 
boundaries of the box imposes the con- 
ditions that 


¥=0 atthe planes 
ἀπο, y=0, z=0, 
x=L, y=l, z=L, 
The expression (9) vanishes properly 
when 0, y=0, orz=0, To make 
it vanish for x= Lz, y = Ly, or Ζ τε ἴων 


the constants K., Ky, Κ, must satisfy the 
respective conditions 


} a2) 


(13) 


where each of the numbers n,, ny, and n, 
can assume any of the positive integral 
values 


Na My; = 1,2,3,4,.... (4) 


Any particular quantum state of the par- 
ticle can then be designated by the values 
assumed by the set of quantum numbers 
{nz, ny, nz}. Its corresponding energy is, 
by (11) and (13), equal to 


(15) 


can be specified, as in the preceding ex- 
ample, by the values of its 3 quantum 
numbers niz, niy, nis; its energy εἰ is then 
given by an expression analogous to (15). 
Each possible quantum state of the entire 
gas can thus be specified by the values 
assumed by the 3N quantum numbers 


{Miz May Miss Nee Τῶν, Nass.» 
το ἢ TINE Ty, ΤΙΝ}. 


Its corresponding energy is given by (16), 
where each term in the sum has the form 
(15). 
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The preceding examples are typical of the quantum-mechanical 
description and serve to illustrate the general comments made at the 
beginning of this section. We can summarize our remarks as follows: 
Each possible quantum state of a system can be specified by some set 
of f quantum numbers, This number f, called the number of degrees 
of freedom of the system, is equal to the number of independent co- 
ordinates (including spin coordinates) needed to describe the system.t 
Any one quantum state of the system can be specified by the particular 
values assumed by all of its quantum numbers. For simplicity, each 
such state can be labeled by some index τ so that the possible quantum 
states can be listed and enumerated in some convenient order 
r=1,2,3,4,.... Our question about the most detailed quantum- 
mechanical description of a system is then answered by the following 
statement: 


The microscopic state of a system can be described by 
specifying the particular quantum state r in which the 
system is found. 


A completely precise description of an isolated system of particles 
would have to take into account ail the interactions between its par- 
ticles and would determine the rigorously exact quantum states of the 
system. If the system is in any one of these exact states, it would then 
remain in this state forever. But no system is ever so completely iso- 
lated in practice that it does not interact at all with its environment; 
furthermore, it would be neither possible, nor indeed useful, to strive 
for precision so great that all interactions between particles are rigor- 


. ously taken into account. Thus, the quantum states actually used to 


describe a system are in practice always its approximate quantum 
states, determined by taking into account all the important dynamical 
properties of its particles while neglecting small residual interactions. 


* A system initially known to be in one of its approximate quantum states 


does not remain in this state forever. In the course of time it will 
instead, under the influence of the small residual interactions, make 
transitions to its other quantum states (except to those to which it can- 
not pass without violating known restrictions imposed by the laws of 
mechanics). 


{For instance, in the example of N particles without spin, the number of degrees of 
freedom f = 3N. 


Sec. 3.1 107 


The hydrogen atom provides a familiar illustration of the preceding 
comments. The quantum states ordinarily used to describe this atom 
. are those determined by taking into account only the Coulomb attrac- 


tion between the nucleus and electron. The residual interaction of - 


the atom with the surrounding electromagnetic field then causes tran- 
sitions between these states. The result is the emission or absorption 
of electromagnetic radiation, giving rise to the observed spectral lines. 

Illustrations of greater pertinence to us are provided by an isolated 
ideal spin system or an isolated ideal gas. If the particles in such a 
system would not interact with each other at all, the quantum states 
calculated in examples (ii) or (v) of this section would be exact, and 
hence no transitions would occur. But this situation does not corre- 
spond to reality. Indeed, we have been careful to stress that, even 
when a spin system or a gas is ideal, the interaction between its con- 
stituent particles is to be considered as almost negligible rather than 


as completely negligible. Thus small interactions exist ina spin system - 


because each magnetic moment produces a small magnetic field at the 
positions of neighboring moments. Similarly, small interactions exist 
in the gas because forces exerted by one particle on another particle 
come into play whenever these particles approach each other suffi- 


ciently closely (and thus “collide” with each other). If these interac- - 


tions are taken into account, the quantum states calculated in the exam- 
ples (ii) and (v) become approximate quantum states. The effect of the 
interactions then is to cause occasional transitions between these states 
(the occurrence of such transitions being less frequent the smaller the 
magnitude of the interactions). Consider, for instance, the system 
consisting of 4 spins whose quantum states are listed in Table 3.2. Sup- 
pose that this system is originally known to be in the state {+ — + +}. 
As a result of the small interactions between the spins, there is a non- 
vanishing probability that the system will be found at some later time 
in any other state, such as {+ + —+}, to which it can pass without 
violating the condition of conservation of energy. 

Our discussion of the specification of the state of a system has been 
given in terms of quantum ideas since the atoms and molecules con- 
stituting any system are known to be described by the laws of quantum 
mechanics. Under appropriate conditions it may sometimes be a con- 
venient approximation to describe a system in terms of classical 
mechanics. The utility and validity of such approximations will be 
considered in Chap. 6. 
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3.2 Statistical Ensemble 

A precise knowledge of the particular microscopic state in which a 
system of particles is found ft any one time| would, in principle, allow 
us to use the laws of mechanics to calculate in the greatest possible 
detail all of the system’s properties at any arbitrary time. Ordinarily, 
however, we do not have available such precise microscopic knowledge 
about a macroscopic system, nor are we interested in such an exces- 
sively detailed description. Hence we are led to discuss the system 
in terms of probability concepts. Instead of considering the single” 
macroscopic system of interest, we then focus attention on an ensem- 
ble consisting of a very large number of systems of this kind, all satis- 
fying the same conditions as those known to be satisfied by the 
particular system under consideration. With respect to this ensemble, 
we can then make various probability statements about the system. 

A complete macroscopic description of a system of many particles 
defines the so-called macroscopic state or macrostate of the system. 
Since such a description is based entirely on the specification of quan- 
tities which can be readily ascertained by macroscopic measurements 
alone, it provides only very limited information about the particles in 
the system. This information is typically the following: 

(Condidar σέο G Ὁ 

(i) Information about the external parameters of the system 

There are certain parameters of the system which are macroscopically 
measurable and which affect the motion of the particles in the system. 
These parameters are called external parameters of the system. For 
example, the system may be known to be located in a given external 
magnetic field B or a given external electric field €. Since the presence 
of such fields affects the motion of the particles in the system, B or € 
are external parameters of the system. Similarly, suppose that a gas 
is known to be contained within a box of dimensions L,, Ly, and L.. 
Then each molecule of the gas must move so as to remain con- 
fined within this box. The dimensions L,, Ly, Lz are thus external 
parameters of the gas. 

Since the external parameters affect the equations of motion of the 
particles in the system, they must also affect the energy levels of these 
particles. Thus the énergy of each quantum state of a system is ordi- 
narily a function of the external parameters of the system. For exam- 


+ ple, in the case of a spin system, Table 3.1 shows explicitly that the 


energies of the quantum states depend on the value of the external 


+ magnetic field B. Similarly, in the case of a particle in a box, the 


expression (15) shows explicitly that any quantum state specified by the 
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quantum numbers {nz,ny,n.} has an energy which depends on the 
dimensions L,, Ly, Lz of the box. 

A knowledge of all the external parameters of a system thus serves 
to determine the actual energies of its quantum states. 


(ii) Information about the initial preparation of the system 

In view of the conservation laws of mechanics, the initial preparation 
of a system implies certain general restrictions on the subsequent mo- 
tion of the particles in the system. For example, suppose that we are 
dealing with a system which is isolated so that it does not interact with 
any other system. Then the laws of mechanics require that the total 
energy of this system (i.e., the total kinetic and potential energy of all 
the particles in the system) remain constant. The system, when origi- 
nally prepared for observation, must have some total energy which 
may be determined to within some finite precision, i.e., this energy 
may be known to lie within some small range between E and E + δὲ. 
Then the conservation of energy implies that the total energy of the 
system must always be between E and E + δὲ. As a consequence 
of this restriction, the system can be found only in those of its quantum 
states which have an energy lying in this range.t 


We shall call the accessible states of a system those of its quantum 
states in which the system can be found without violating any condi- 
tions imposed by the information available about the system. A sta- 
tistical ensemble selected in accordance with the information available 
about the system must, therefore, comprise systems all of which are 
in their accessible states. As pointed out previously, the specification 
of the macrostate of a system consisting of very many particles pro- 
vides only very limited information about the system. If the system 
is known to be in a given macrostate, the number of quantum states 
accessible to the system is thus ordinarily very large (since the number 


4 In some cases one might also want to take into account other restrictions, such as those 
imposed by the conservation of total momentum. Although this could be done, it is ordi- 
narily not of interest for the following reason: In the case of most laboratory experiments, 
it can be imagined that the system under investigation is enclosed in some container fas- 
tened to the floor of the laboratory and thus to the large mass of the earth. Any collision 
of the particles in the system with the container results then in a nearly negligible velocity 
change of the earth, although the latter can absorb any amount of momentum from the 
system while acquiring negligible energy. (The situation is similar to that of a ball being 
bounced off the earth.) Under these circumstances, there is no restriction on the possible 
momentum of the system, although its energy remains constant. Thus the system can be 
considered isolated with respect to energy transfer, while it is not isolated with respect to 
momentum transfer, 
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Table 3.8 Systematic enumeration of all th 
states, labeled by some index r, which are ac- 
cessible to the system A* when its total energy 
in a magnetic field B is equal to —3y0B. The 
system A* consists of a subsystem A with three 
spins 4, each having magnetic moment Ho, 
and a subsystem A’ with two spins ὁ, each 
having magnetic moment 2pio. 
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of particles in the system is very large). For instance, in the case of 
an isolated system merely known to have an energy between E and 
E + δῈ, all quantum states having energies in this range are accessible 
to the system. 3 

It is conceptually simplest to discuss the case of a system which is 
isolated in the sense that it does not interact with any other systems 
so as to exchange energy with them.} Suppose that the macrostate 
of such an isolated system is specified by stating the values of its exter- 
nal parameters and the particular small range in which its energy is 
known to lie. This information determines then, respectively, the 
energies of the various quantum states of the system and the subset of 
those quantum states which are actually accessible to the system. 

The essential content of the preceding remarks can be illustrated 


most simply by some systems with very few particles. 


Example (i) 


Consider a system of four spins 4 (each 
having 8 magnetic moment pio) located 
in an applied magnetic field B. The 
possible quantum states and associated 
energies of this system are listed in Table 
3.2. Suppose that this system is isolated 
and known to have a total energy which 


Example (ii) 


Consider a system A* which consists of 
two subsystems A and A’ which can in- 
teract to a small extent and thus ex- 
change energy with each other. System 
A consists of three spins 4, each having 
ἃ magnetic moment po. System A’ con- 
sists of two spins }, each having a mag- 
netic moment 2μο. The system Α΄ is 
located in an applied magnetic field B. 
We shall denote by M the total magnetic 
moment of A along the direction of B, 
and by M’ the total magnetic moment of 
A’ in this direction. The interaction be- 
tween the spins is assumed to be almost 
negligible. The total energy E* of the 
entire system A* is then given by 


E* = —(M+M")B. 


is equal to —2oB. The system can then 
be found in any one of the following four 
states accessible to it: 


{+++-}, 
{+-++4+} 


{++-+). 
{-=+++}. 


The system A* consists of 5 spins and 
thus has a total of 25 = 32 possible 
quantum states. Each of these can be 
labeled by five quantum numbers, the 
three numbers συ, 02, os specifying the 
orientations of the three magnetic mo- 
ments of A, and the two quantum num- 
bers σή, σῷ specifying the orientations of 
the two magnetic moments of A’. Sup- 
pose that the isolated system A* is known 
to have a total energy E* equal to — 3yoB. 
Then A* must be found in any one of the 
five accessible states, listed in Table 3.3, 
which are compatible with this total 
energy. 


} Any system which is not isolated can then be treated as a part of a larger system which | 


is isolated. 


wow 


mw 
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The task of giving a statistical description of a macroscopic system 
can now be formulated very precisely. In a statistical ensemble of 
such systems, every system is known to be in one of its accessible 
quantum states. We should then like to predict the prebability that 
the system is found in any given one of these accessible states. In 
particular, various macroscopic parameters of the system (e.g,, its total 
magnetic moment or the pressure which it exerts) have values which 
depend on the particular quantum state of the system. Knowing the 
probability of finding the system in any one of its accessible states, we 
should then be able to answer the following questions of physical in- 
terest: What is the probability that any parameter of the system 
assumes any specified value? What is the mean value of such a 
parameter? What is its standard deviation? 


3.3 Statistical Postulates 


In order to make theoretical predictions concerning various probabili- 
ties and mean values, we must introduce some statistical postulates. 
Let us, therefore, consider the simple case of an isolated system (with 
given external parameters) whose energy is known to lie in a specified 
small range between E and E + δὲ. As already mentioned, this sys- 
tem can be found in any one of a large number of[accessible states||| 
Focusing attention on a statistical ensemble of such systems, what can 
we say about the probability of finding the system in any one of these 
accessible states? 

To elucidate this question, we shall exploit some simple physical 
arguments similar to those used in Secs. 1.1 and 1.2. There we con- 
sidered the example of an ideal gas and talked about the distribution 
of molecules in a box over their possible positions in space. Analog- 
ously, our more abstract general arguments will now deal with the 
distribution of systems in an ensemble over their accessible states. 
Our discussion will then readily lead to the formulation of general 
postulates suitable as a basis for a statistical theory. 

Let us examine first the simple situation where the system under 
consideration is known at some time to have equal probability of being 
found in each one of its accessible states. In other words, we consider 
the case where the systems in the statistical ensemble are known at 
some time to be uniformly distributed over all their accessible states. 


* What happens then as time goes on? A system in a given state will, 


of course, not remain there forever; as we pointed out at the end of 
Sec. 3.1, it will continually make transitions between the various states 
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accessible to it. The situation is thus a dynamic one. But there is 
nothing intrinsic in the laws of mechanics which gives preference to 
any one of the accessible states of a system compared to any other one. 
Thus, contemplating the ensemble of systems as time goes on, we do 
not expect that the number of systems in some particular subset of the 
accessible states will become less while that in some other subset will 
become greater.} Indeed, the laws of mechanics can be applied to 
an ensemble of isolated systems to show explicitly that, if the systems 
are initially uniformly distributed over all their accessible states, then 
they will remain uniformly distributed over these states forever.{ Such 
a uniform distribution remains, therefore, unchanged in time. 


Example 


To give a very simple illustration, we may 
return to Example (i) of Sec. 3.2. There 
we dealt with an isolated system of four 
spins having a total energy —2jioB. Sup- 
pose that this system is known at some 
time to have equal probability of being 
found in each one of its four accessible 
states 

(F442). AFtSh), 

(t=); sot) 


In accord with our preceding arguments, 
there is nothing intrinsic in the laws of 


mechanics which would give preferred 
status to one of these four states com- 
pared to any other one. Hence, we do not 
expect that the system would at some 
later time be more likely to be found in 
one particular state, say the state 
{+++ —}, rather than in any other one 
of its four accessible states. The con- 
templated situation, therefore, does not 
change in time. Hence the system will 
continue to have equal probability of 
being found in each one of its four acces- 
sible states. 


The preceding argument thus leads to the following conclusion 
about an ensemble of isolated systems: If the systems in such an en- 
semble are uniformly distributed over their accessible states, the 
ensemble is time-independent. In terms of probabilities, this state- 
ment can be phrased as follows: If an isolated system has equal proba- 
bility of being found in each one of its accessible states, then the 
probability of finding the system in each one of its states is time- 
independent. 


+ This argument is merely a more general version of that used in Sec. 1.1 where we con- 
sidered an ideal gas. When the molecules of the gas are initially distributed uniformly 
throughout a box, they are not expected tg) concentrate themselves preferentially in one 
part of the box as time goes on since there is nothing in the laws of mechanics which gives 
preference to one part of the box compared to any other. 

1 This result is a consequence of the so-called “Liouville theorem.” Proofs of this theo- 
rem, which require a knowledge of advanced mechanics far beyond the level of this book, 
can be found in R. Ὁ. Tolman, The Principles of Statistical Mechanics, chaps. 3 and 9 
(Oxford University Press, Oxford, 1938). 


ms 
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An isolated system, by definition, is said to be in equilibrium if the 
probability of finding the system in each’ one of its accessible states is 
independent of time. In this case the mean value of every measurable 
macroscopic parameter of the system is, of course, also independent 
of time.+ In terms of this definition of equilibrium, the conclusion of 
the preceding paragraph can be summarized by the following state- 


ment: 


| If an isolated system is found with equal probability in an 


each one of its accessible states, it is in equilibrium. 


Let us next examine the general case where the isolated system 
under consideration is known at some initial time to be in some subset 
of the states actually accessible to it. A statistical ensemble of such 
systems at this time would then contain many systems in some subset of 
their accessible states, and no systems at all in the remaining accessible 
states. What happens now as time goes on? As we mentioned before, 
there is nothing intrinsic in the laws of mechanics which would tend 
to favor some of the accessible states over others. By their definition, 
the accessible states are also such that the laws of mechanics impose 
no restrictions preventing the system from being found in any one of 
these states. It is, therefore, exceedingly unlikely that a system in the 
ensemble remains indefinitely in the subset of states in which it finds 
itself initially and avoids the other states which are equally accessible 
to it.{ Indeed, by virtue of small interactions between its constituent 
particles, the system will in the course of time continually make tran- 
sitions between all of its accessible states. As a result, each system in 
the ensemble will ultimately pass through essentially all the states in 
which it can possibly be found. The net effect of these continual tran- 
sitions is analogous to that resulting from the repeated shuffling of a 
deck of cards. If the shuffling is kept up long enough, the cards get 
so mixed up that each one is equally likely to occupy any position in 
the deck irrespective of how the deck was arranged initially. Similarly, 
in the case of the ensemble of systems, one expects that the systems will 


} Indeed, to ascertain experimentally that a system is in equilibrium, one verifies that the 
mean values of all its observable macroscopic parameters are independent of time. One 
then presumes that the probability of finding the system in any one state is independent 
of time so that the system is in equilibrium. 

} The present discussion in terms of quantum states and transitions between them is 
again merely a generalization of the arguments applied in Sec 1.2 to the case of an ideal 
gas. When all the molecules of the gas are in the left half of a box, the resulting situation 
is very unlikely and the molecules become quickly distributed throughout the entire box. 
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ultimately become uniformly (i.e., randomly) distributed among all 
their accessible states.} Once this situation has been attained, the 
distribution remains uniform in accordance with (17). This final situa- 
tion thus corresponds to a time-independent equilibrium situation. 

The conclusion of the preceding argument can then be summarized 
as follows: 


If an isolated system is not found with equal probability | “= 
in each one of its accessible states, it is not in equilib- 
rium. It then tends to change in time until it attains 
ultimately the equilibrium situation where it is found 
with equal probability in each one of its accessible states. 


(18) 


Note that this conclusion is analogous to the statement (1.7) of Chap. 1. 
It merely formulates in more precise and general terms the tendency 
of an isolated system to approach its most random situation. 


Example 


The preceding comments can again be 
illustrated with our preceding example of 
the isolated system of four spins. Sup- 
pose that this system has been prepared 
in such a way that it is initially known to 
be in the state {+ +4-+—]}. The tota! en- 
ergy of the system is then —2yoB and 


esses occur in which one moment flips 
from the ‘‘up" direction to the ‘‘down” 
direction while another moment does the 
reverse (the total energy remaining, of 
course, unchanged). As a result of any 
such process, the system goes from 
some initial accessible state to some 


remains constent at this value. There other accessible state. After many re- 

are, however, three other states peated transitions of this kind, the sys- 
tem will ultimately be found with equal 

{++-+} {4+-++4), (—+++) probability in any one of its four acces- 

which also have this energy and are thus sible states 

equally accessible to the system. In- 

deed, as a result of small interactions feces ie tht), 

between the magnetic moments, proc- {+-++),  {-+++)- 


We shall adopt the statements (17) and (18) as the fundamental 
postulates of our statistical theory. Both (17) and (18) can actually 
be deduced from the laws of mechanics, (17) rigorously and (18) with 
the aid of some assumptions. The statement (18) is of special impor- 
tance since it leads, in particulars to the following assertion: 


} With some assumptions inherent in a statistical description, this expectation can be 
deduced from the laws of mechanics as a consequence of the so-called “H-theorem.” A 
simple proof of this theorem and further references can be found in F. Reif, Fundamentals 
of Statistical and Thermal Physics, Appendix A.12 (McGraw-Hill Book Company, New 
York, 1965). 
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If an isolated system is in equilibrium, it is found with (19) 
equal probability in each one of its accessible states. 


This statement is the converse of (17). The validity of (19) follows 
immediately from (18). Indeed, if the conclusion of (19) were false, 
(18) implies that the premise of (19) would be violated. 

The statistical situation of greatest simplicity is clearly that which 
is time-independent, ie., that involving an isolated system in equilib- 
rium. In this case, the statement (19) makes an|unambiguous assertion 
about the probability of finding the system in each one of its accessible 
states. The statement (19) is, therefore, the fundamental postulate 
upon which we can erect the entire theory of macroscopic systems in 
equilibrium. This fundamental postulate of equilibrium statistical me- 
chanics is sometimes called the postulate of equal a priori probabilities. 
Note that this postulate is exceedingly simple. Indeed, it is completely 
analogous to the simple postulate (assigning equal probabilities to head 
or tail) used in discussing the experiment of tossing a coin. The ulti- 
mate validity of the postulate (19) can only be established, of course, by 
checking its predictions against experimental observations. Since a 
large body of calculations based on this postulate have yielded results 
in very good agreement with experiment, the validity of this postulate 
can be accepted with great confidence. 

The theoretical problems become much more complex when we 
are dealing with a statistical situation which does change with time, 
i.e. with a system which is not in equilibrium. In this case the only 
general statement which we have made is embodied in (18). This 
postulate makes an assertion about the direction in which the system 
tends to change (i.e., this direction is such that the system tends to 
approach the equilibrium situation of uniform statistical distribution 
over all accessible states). The postulate provides no information, 
however, about the actual time required for the system to attain its 
ultimate equilibrium condition (the so-called relaxation time). This 
time might be shorter than a microsecond or longer than a century, 
depending on the detailed nature of the interactions between the 
particles of the system and on the resultant frequency with which tran- 
sitions actually occur between the accessible states of this system. A 
quantitative description of a nonequilibrium situation may thus be- 
come quite difficult since it requires a detailed analysis of how the 
probability of finding a system in each of its states changes with time. 
On the other hand, a problem involving a system in equilibrium merely 
requires use of the simple postulate (19) of equal a priori probabilities. 
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Fig. 3.4 Sudden expansion of a gas. (a) Ini- 
tial situation. (b) Situation immediately after 
the piston has been moved. (c) Final situation. 


116 Statistical Description of Systems of Particles 


Remarks concerning the applicability of equilibrium arguments 


It is worth pointing out that the idealized 
concept of equilibrium is, in practice, a 
relative one. The important comparison 
is always between the relaxation time 7, 
(the characteristic time necessary for the 
system to attain equilibrium when it is 
not in equilibrium initially) and the time 
τε of experimental! interest in a given 
discussion. 

Imagine, for example, that when the 
piston in Fig. 3.4 is suddenly pulled to 
the right, it takes about 10-3 seconds be- 
fore equilibrium is attained and the whole 
gas is uniformly distributed throughout 
the entire volume of the box. Thus 
1, ~ 10-3 sec. Suppose now that, in- 
stead of doing the preceding experiment, 
we proceed as in Fig. 3.5 by moving the 
piston to the right quite slowly so that 
it takes a time τ, = 100 sec to complete 
the task. Strictly speaking, the gas is 
not in equilibrium during this time since 
its volume is changing. But since 
το» Tr, the molecules at any instant 
have had sufficient time to become es- 
sentially uniformly distributed through- 
out the entire volume available to them 
at that instant. The situation in the gas 
would then remain effectively unchanged 


if one imagined the piston to be really 
stopped at any instant. In practice, the 
gas can thus be considered to be in equi- 
librium at all times. 

As an example of the opposite limit 
where τε 4 τι, consider a piece of iron 
which is rusting at a very slow rate; in a 
time 1,= 100 years it would be com- 
pletely transformed into iron oxide. 
Strictly speaking, this again is not an 
equilibrium situation. But during a time 
το of experimental interest, say τ, = 2 
days, the situation would be effectively 
unchanged if one imagined rusting was 
prevented (e.g., by removing all the 
surrounding oxygen gas). The iron 
would then be in strict equilibrium. 
Thus, in practice, the piece of iron can 
again be discussed by equilibrium argu- 
ments. 

It is, therefore, only in the case when 
τι το τε (i.e., when the time of experimen- 
tal interest is comparable to the time 
required to reach equilibrium) that the 
time-dependence of the system is of es- 
sential significance. The problem is then 
more difficult and cannot be reduced to 
a discussion of equilibrium, or near- 
equilibrium, situations. 


3.4 Probability Calculations 

The fundamental postulate (19) of equal a priori probabilities permits 
the statistical calculation of all time-independent properties of any 
system in equilibrium. In principle, such a calculation is very simple. 
Consider thus an isolated system in equilibrium and denote by & the 
total number of states accessible to it. According to our postulate, 
the probability of finding the system in each one of its accessible states 
is then the same and hence simply equal to 1/Q. (The probability of 
finding the system in a state which is not accessible to it is, of course, 
zero.) Suppose now that we are interested in some parameter y of 
the system; for example, y might be the magnetic moment of the sys- 
tem or the pressure exerted by it. When the system is in any particu- 
lar state, the parameter y assumes correspondingly some definite value. 
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Let us enumerate the possible values which y can assume and desig- 
nate them by ψ1, y2,.--, Yn- Among the Q states accessible to the 
system, there will then be some Q; such states in which the parameter 
assumes the particular value y;. The probability P; that the parame- 
ter assumes the value y; is then simply the probability that the system 
is found among the Q; states characterized by this value y;. Thus P; 
is obtained by summing 1/2 (the probability of finding the system in 
any single one of its accessible states) over the Q; states where y ie) 
assumes the value y;; i.e., P; is merely Q; times as large as the proba- 

bility 1/2 of finding the system in each one of its accessible states. 


Hencet 


i (21) 


where the summation is over all possible values of y. The dispersion Ὁ). 


of y can be calculated in a similar way. ΑἹ] statistical calculations are 
thus basically just as simple as those involved in discussing the tossing 
of a set of coins. i == 


Example (i) ΟῚ »Sa2rf[osa.c5-%4 Gwar aS 


Consider again the system of four spins _ spins, say the first. What is the proba- + 
whose states are listed in Table 3.2, and bility P, that its magnetic moment points 
suppose that the total energy of this sys- up? Since it points up in three of the Ξ 
temis known tobe —2yoB. Ifthesystem four equally likely states accessibletothe ae (c) E 
is in equilibrium, it is then equally likely entire system, this probability is simply : 


to be in each one of the four states aS Ξ 
(ἘΞ -(@+=4}5 oat Fig. 3.5 Very slow (quasi-static) expansion of 
᾿ What is the mean magnetic moment » ἃ gas. (a) Initial situation. (Ρ) Intermediate 
(S-4+4+), +++} of this spin in the direction of the applied situation. (ο) Final situation. 
Focus attention on any one of these field BP Since this moment is su in three 


+ If the possible values of the parameter are continuous rather than discrete, we can Ξ 
always proceed as in Sec. 2.6 and subdivide the domain of the possible values of y into Ἂς 
very small intervals of fixed size δι. Since these intervals can be enumerated and labeled 
by some index i, we may then simply denote by y; the value of the parameter when it lies 
anywhere in the ith such interval. The problem is then reduced to one where the possible ==. 
values of y are discrete and countable. = . 
} The result (20) is so simple because our fundamental postulate asserts that the system 
has equal probability of being found in each one of its accessible states. In an ensemble 
of © systems, the number //; of systems with y = ψι is therefore simply proportional to 
the number {; of states accessible to a system with y=y;. Thus ΡῚ ΞΞ. Ὁ = 2/0. 
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of the accessible states and —po in one 
of the accessible states, this mean value 
is 


Π -- 3H + (=Ho) 
4 


ek: 
= pho 


Note, incidentally, that one given spin 
in this system is not equallyTikely to have 


Example (ii) 


Consider the system of spins discussed 
in Table 3.3. The total energy of this 
system is known to be —3y0B and the 
system is assumed to be in equilibrium. 


8 moment which points up or down; i.e., 
it is not found with equal probability in 
each one of its two possible states. This 
result does not, of course, contradict our 
fundamental statistical postulate since 
such a single spin is not isolated, but is 
part of a larger system where it is free to 
interact and exchange energy with other 
spins. 


possible values 3 or -- μο. The proba- 
bility PQ) that it assumes any of these 
values can then be read off directly from 
the table; thus 


The system is then equally likely to be P(3po) = %, 
found in each one of its 5 accessible 
states. Let us, for example, focus atten- and P(—po) = ὃ. 


tion on the subsystem A consisting of 3 
spins and denote by M its total magnetic 
moment in the direction of the field B. 
It is seen that M can assume the two 


The mean value of M is then given by: 


= 2)3y0) + (3)(—Ho) _ 3 
m= t = 3h 


The preceding examples are extremely simple since they deal with 
systems consisting of very few particles. They do, however, illustrate 
the general procedure used in calculating probabilities and mean values 
for any system in equilibrium, no matter how complex. The only 
difference is that the enumeration of accessible states characterized 
by a particular value of a parameter may become a more difficult task 
in the case of a macroscopic system consisting of very many par- 
ticles. The actual computation may thus be correspondingly more 
complicated. 


3.5 Number of States Accessible to a Macroscopic System 
The preceding four sections contain all the basic concepts needed for 
ἃ quantitative statistical theory of macroscopic systems in equilibrium 
and for a qualitative discussion of their approach to equilibrium. We 
shall use the remainder-of this chapter to become familiar with the 
significance of these concepts and to show how they provide a precise 
description of some of the qualitative notions introduced in Chap. 1. 


These preliminaries will prepare us to elaborate these concepts syste- 


matically throughout the rest of the book. 


,- 
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As we have seen, the properties of ἃ system in equilibrium can be 
calculated by counting the number of states accessible to the system 
under various conditions. Although this counting problem may be- 
come difficult, it can ordinarily be circumvented in practice. As is 
usual in physics, progress may become easy if one strives to gain 
insight instead of trying to compute blindly. In particular, what is 
important in the present instance is to τι ize some of the general ' 
properties exhibited by the number of states accessible to any system 
consisting of very many particles. Since a qualitative understanding 
of these properties and some approximate estimates will be quite ade- 
quate, fairly crude arguments will suffice to achieve our purpose. 

Consider a macroscopic system with given external parameters so 
that its energy levels are determined. We shall denote the total energy 
of this system by E. To facilitate the counting of states, we shall group|) 
these states according to energy by subdividing the energy scale into 
equal small intervals of fixed magnitude 6E. Thus dE is supposed to) 
be very small on a macroscopic scale (i.e., very small compared to the 
total energy of the system and small compared to the contemplated 
precision of any macroscopic measurement of its energy). On the 
other hand, δὲ is supposed to be large on a microscopic scale (i.e., much (ii) 
larger than the energy of a single particle in the system and thus also 
much larger than the separation in energy between adjacent energy 
levels of the system). Any interval δὲ thus contains very many pos- 
sible quantum states of the system. We shall introduce the notation 


Je 


Q(E) = the number of states with energies lying in the 


interval between E and E + 8E. (22) 


The number of states 2(E) depends on the magnitude SE chosenas τα 
the subdivision interval in a given discussion. Since δὲ is macroscopi- 
cally very small, 2(£) must be simply proportional to 6E, i.e., we can 
writet 
QE) = ρ(Ε) 8E (9) 


where p(E) is independent of the size of δὲ. [The quantity ρ(Ε) is 
called the density of states because it is equal to the number of states 
per unit energy range at the given energy E.] Since the interval δὲ 


{The situation here is similar to that encountered in Sec. 2.6 when we discussed con- 
tinuous probability distributions. The number of states Ὡ(Ε) must vanish when δῈ ap- 
proaches zero and must be expressible as a Taylor series in powers of SE. When δὲ is 
sufficiently small, this series reduces to (23) since terms involving higher powers of δῈ are 
then negligible. 
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Fig, 3.6 The points on the line indicate the 
possible values n = 1, 2, 3, 4,... of the quan- 
tum number n specifying the state of a single 
particle in one dimension. The values of n 
corresponding to the energy values E and 
E + δῈ are indicated by the vertical lines. 
The region in light gray includes all the values 
of n for which the energy of the particle is less 
than Ε. The region in dark gray includes all 
the values of n for which the energy lies be- 
tween E and E + 6E. 
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contains very many states, 2(E) changes only by a small fraction of 
itself as one goes from one energy interval to an adjacent one. Hence 
Q(E) can be regarded as a smoothly varying function of the energy E. 
We shall be interested specifically in examining how sensitively 2(E) 
depends on the energy E of a macroscopic system. 

Note, incidentally, that it is possible to obtain 2(E) if one knows 
the quantity 


©(E) = the total number of states having energies less 24 
than E. (24) 


The number (E) of states having an energy lying between E and 
E + δῈ is then simply given by 
Q(E) = ΦΙΕ + δῈ) — O(E) = oe. (25) 
Before discussing the general properties of 2(E) in the case of a 
macroscopic system, it will be instructive to illustrate how the number 
of states 2(E) can be counted in the case of some very simple systems 
consisting of a single particle. 


Example (i) Single particle in a one-dimensional box 


Consider a single particle of mass m free Since successive quantum states corre- 
to move in one dimension in a box of spond to values of n differing by unity, 
length L. The possible energy levels of _ the total number Φ(Ε) of quantum states 


this system are then, by (8), having an energy less than E, or a quan- 
sais tum number less than n, is then simply 
= πὸ = 
E ἀπ 1" (26) equal to(n/1)=n. Thus 
wheren = 1,2,3,4.... Thecoefficient 


L a 
of n? is very small if L is of macroscopic O(E) =n = πε. (86) 


size. The quantum number n is thus 
very large for energies of ordinary inter- 
est.t By (26), the value of n for a given 
energy Eis 


Accordingly, (25) yieldst 


n= FemEy2. (27) Ὧ(Ε) = yom QE-12 8E, (29) 


+ For example, if L = 1 cm and m ~ 5 X 103 gm, the mass of a nitrogen molecule 
given by (1.29), this coefficient is about 1033 erg. But the average energy of such a mole- 
cule at room temperature is, by (1.28), of the order of 1034 erg. Hence (26) yields for n 
a typical value of the order of 10°. 

{Since n is very large, a change in n by unity produces only a negligible fractional ἡ 
change inn or E. The fact that n or E can assume only discrete values is then of negligible 
importance so that these variables can be treated as though they were continuous. In 
differentiating, it is merely necessary to consider any contemplated change in n to be 
always large compared to unity so that dn > 1, but to be very small in the sense that 
ἄπ« π. 
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Example (ii) Single particle in a three-dimensional box 


Consider a single particle of mass m free 
to move inside a three-dimensional box. 
For the sake of simplicity, we assume 
this box to be cubic and of edge length L. 
The possible energy levels of this system 
are then given by (15) with L.=L, = 
1. = L; thus 


ἢ κε 
ἘΞ ἘΞ πε m2) (80) 
where nz, n,, nz = 1, 2, 3,.... In the 


“number space" defined by three per- 
pendicular axes labeled by τις, ny, nz, the 
possible values of these three quantum 
numbers lie thus geometrically at the 
centers of cubes of unit edge length, as 
indicated in Fig. 3.7. Asin the preceding 
example, these quantum numbers are 
again ordinarily very large for a molecule 
in a macroscopic box. By (30) it follows 
that 


ne +n? + 


For a given value of E, the values of 
Nz, Ny, τις Which satisfy this equation lie 
on a sphere of radius R in Fig. 3.7. Here 


L 
na 1/2 
(2mE)v2, 


The number Φ(Ε) of states with energy 
less than E is then equal to the number 
of unit cubes lying within this sphere and 
having positive values of n., ny, and nz; 
i.e., it is simply equal to the volume of 
one octant of the sphere of radius R. 
Thus 


wn =3(fon)=5 (Gm 


By (25), the number of states with energy 
between E and E + δὲ is then 


QE) = saa emme ΔῈ (88) 


where V = L’ is the volume of the box. 


\eme) Ξε ᾿Ξ. 


Let us now make a crude order-of-magnitude estimate to find 
roughly how the number of states 2(E) or, equivalently ®(E), depends 
on the energy E of a macroscopic system of particles. Any such sys- 
tem can be described by some set of f quantum numbers where f, the 
number of degrees of freedom of the system, is of the order of Avo- 
gadro’s number. Associated with each quantum number there is a 
certain contribution ε to the total energy E of the system. We shall 
denote by g(E) the total number of possible values of this quantum 
number when its associated energy is less than «. The number 9 is 
then unity (or of the order of unity) when ε has its lowest possible 
value €o, and must clearly increase as € increases (although it may ap- 
proach a constant value in an exceptional case).f Ordinarily we 
expect φ to increase roughly proportionately to the energy interval 
(ε — ε). Hence we can write the approximate relation that 


+ This exceptional case arises if a system has only a finite total number of possible states 
and thus also an upper bound to its possible energy. (This can occur only if one ignores 
the kinetic energies of the particles in a system and considers only their spins.) There- 
fore, after at first increasing as a function of (ε — 9), the quantity p must in this case ulti- 
mately become constant. 


E+6E 


Fig. 3.7 The points indicate schematically 
in two dimensions the possible values of 
Nz, Ny, Nz = 1,2,3,4,...of the quantum num- 
bers specifying the state of a single particle in 
three dimensions. (The n, axis points out of 
the paper.) The values of nz, n,, and n, corre- 
sponding to the energy values E and E + 6E 
lie on the indicated spherical surfaces. The 
region in light gray includes all values of n for 
which the energy of the particle is less than E. 
The region in dark gray includes all the values 
of for which the energy lies between E and 
ESE. 


οι 
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ordinarily, ge) & (€ — €0)* where a ~ 1, (33) 


1.6., where ἃ is some number of the order of unity.t 

Consider now the entire system with f degrees of freedom. Its 
total energy E (the sum of the kinetic and potential energies of all the 
particles contained within the system) is the sum of the energies asso- 
ciated with all its degrees of freedom. Hence the value of the energy 
E (in excess of its lowest possible value Eo) should be roughly f times 
as large as the average energy ε per degree of freedom (in excess of 
its lowest possible value €o). Thus 


Ε-- Eo~ f(e- ον. (34) 


Corresponding to a total energy of the system of E or less, there are 
then approximately @(¢) possible values which can be assumed by the 
1st of its quantum numbers, φίε) possible values which can be assumed 
by the 2nd of its quantum numbers, . . . , and φίε) possible values which 
can be assumed by the fth of its quantum numbers. The total number 
of possible combinations of these quantum numbers, i.e., the total 
number Φ(Ε) of states with total energy less than E, is then obtained 
by multiplying {the number of possible values of the 1st quantum 
number} by {the number of possible values of the 2nd quantum num- 
ber}, then multiplying by {the number of possible values of the 3rd 
quantum number}, and finally multiplying by {the number of possible 
values of the fth quantum number}. Thus 


(E) ~ [96]! (35) 


where ε is related to E by (34). The number 2(E) of states with energy 
between E and E + 6E is then given by (25). Thus 


QUE) = ἘΣ 8E -- fol ag δῈ = gf de 8E (36) 
since dp/dE = [1 (dp/de) by (34). 


The preceding approximate considerations are quite sufficient to 
lead to some remarkable conclusions based on the fact that f is a very 
large number. Indeed, since we are dealing with a macroscopic sys- 
tem, f is of the order of Avogadro’s number, i.e., f ~ 1074. Numbers 
of this magnitude are so fantastically large that their peculiar prop- 
erties are unlikely to bg familiar from everyday experience. 

As the energy E of the system increases, so does the energy € per 
degree of freedom [see Eq. (34)]. Correspondingly, the number of 


+ For example, in the case of the quantum number describing the motion of 2 single 
particle in one dimension, φ & €¥/2 by virtue of (28). (The lowest possible energy ¢o there 
is negligible compared to ε and thus is essentially zero.) 
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states φίε) per degree of freedom increases relatively slowly. But, 
since the exponents in (35) or (36) are of the order of f and thus ex- 
tremely large, the number of states ®(E), or Q(E), of the system with 
1 degrees of freedom increases at a fantastic rate. Hence we arrive 
at the following conclusion: 


+ 


The number of states Q(E) accessible to any ordinary 
macroscopic system is an extremely rapidly increasing (37) 
function of its energy E. 


Indeed, combining (36) with (33) and (34), we obtain for the depend- 
ence of 2 on E the approximate relations 


QUE) cx (€ = ερ)ο7- cx G5)". 


Thus we can assert that, 


for any ordinary system, (38) 
Q(E) « (E — ΕΟΥ̓ approximately. ἢ 


Here we have neglected 1 compared to f and have simply put a = 1 
since (38) is intended to indicate only very crudely the dependence 
of 2 on E. It matters little, for that purpose, whether the exponent 
in (38) is f, $f, or any other number of the order of of 

We can also make some statements about the magnitude of In Q. 
By (36) it follows that 


In 9) = (f= 1)Ing +n (az). (39) 


Let us now make the general observation that, if one deals with any 
large number such as f, its logarithm is always roughly of the order of 
unity and thus utterly negligible compared to the number itself. For 
example, if f = 1024, Inf = 55; thus Inf<f. Consider then the 
terms on the right side of (39). The first term is of the order of ft 


+ The qualifying phrase for any ordinary system is intended to exclude the exceptional 
case (mentioned in the footnote on page 121) where the kinetic energy of the par- 
ticles in a system is ignored and where their spins have sufficiently high magnetic energy. 
(Focusing attention on the spins alone may be a justifiable approximation if the transla. 
"onal motion of the particles has only small effect on the orientations of their spins. ‘The 
spin orientations and translational motion of the particles can then be treated separately.) 

1 This is always true unless the energy E of the system is very close to its ground state 
energy Eo when φ ~ 1 for all degrees of freedom. Indeed, we know from our general 
comments at the beginning of Sec. 3.1 that, when a system approaches its ground state 
energy, the number of quantum states accessible to it is of order unity so that 2 ~ 1. 


int 
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The quantity (dp/de) δῈ (where δῈ is any energy interval which is 
large compared to the spacing between the energy levels of the system) 
represents the number of possible values of a single quantum number 
in the interval δὲ and depends on the size of SE. But irrespective of 
the size chosen for δὲ, even the most conservative estimate would lead 
us to expect that this quantity (dp/de) 5E lies somewhere between 
land 1019, Its logarithm, however, then lies between 0 and 230 and 
is thus utterly negligible compared to f which is of the order of 1074. 
Hence the second term on the right of (39) is utterly negligible com- 
pared to the first. We thus arrive at the following conclusions: 


In the case of a macroscopic system, the number of 
states Q(E) having an energy between E and E + δὲ is 
to excellent approximation such that 


for E ¢ Eo, In Q(E) is independent of δὲ; (40) 
In QE) ~ ἢ (41) 


In words, (40) and (41) assert that, as long as the energy E is not very 
close to its ground state value, In Ὡ is independent of the magnitude 
of the chosen subdivision interval δὲ and is of the order of the number 
of degrees of freedom of the system. 


3.6 Constraints, Equilibrium, and Irreversibility 

Let us now summarize the general point of view which we have devel- 
oped and which we shall apply repeatedly to discuss any situation 
involving macroscopic systems. The starting point for all our consid- 
erations is an isolated system.t Such a system is known to satisfy 
certain conditions which can be described on a macroscopic scale by 
specifying the value of some macroscopic parameter y of the system 
(or the values of several such parameters). These conditions act as 
constraints restricting the states in which the system can be found to 
those compatible with these conditions, i.e., to what we have called 
the accessible states of the system. The number Ὁ) of these accessible 
states depends thus on the constraints to which the system is known 
to be subject so that Ὁ) =42(y) is some function of the specified macro- 
scopic parameter of the System. 


+ Any system which is not isolated can always be treated as part of a larger system which 
is isolated. 
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Example 


To keep in mind a specific illustration, 
consider the familiar system shown in 
Fig. 3.82. This system consists of an 
ideal gas contained within the volume V; 
constituting the left part of a box. The 
right part of the box is empty. Here the 
Partition subdividing the box into the two 
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parts acts as a constraint which restricts 
the accessible states of the gas to those 
where all its molecules are located in the 
left part of the box. Hence the number 
Q of accessible states of the gas depends 
‘on the volume Vj of this left part, i.e., 
2= AV). 


The statistical description of the system involves probability state- 
ments about an ensemble of such systems all of which are subject to 
the same specified constraints. If the system is in equilibrium, it is 
found with equal probability in each one of its 2 accessible states, and 
conversely. If the system is not found with equal probability in each 
one of its @ accessible states, then the statistical situation is not inde- 
pendent of time.} The system then tends to change until it ultimately 
attains the equilibrium situation where it is found with equal proba- 
bility in each one of its 2 accessible states. The previous statements 
represent, of course, merely the content of our fundamental postulates 
(18) and (19). 

Consider an isolated system initially in equilibrium under conditions 
where Q; states are accessible to it. The system is then found with 
equal probability in each one of these states. Suppose that some of 
the original constraints on the system are now removed. (For instance, 
in the example of Fig. 3.8, suppose that the partition is now removed.) 
Since the system is then subject to fewer restrictions than before, the 
number of states accessible to the system afterward certainly cannot 
be less than before; indeed, it will ordinarily be very much greater. 
Denoting by ὧδ, this final number of states in the presence of the new 
constraints, we can thus write 


2, > 5. (42) 


Immediately after the original constraints are removed, the proba- 
bility of finding the system in any one of its states will be the same as 
before. Since the system had initially equal probability of being in 
each one of its 2; accessible states, it will thus, immediately after re- 
moval of the original constraints, still be found with equal probability 
in each one of these Ὡ; states. Two cases can then arise: 


In other words, in the ensemble of systems, the probability of finding the system in a 
given state changes with time for at least some of the states. 


Fig. 3.8 An ideal gas confined within a box. 
{a) Initial situation where the gas is confined 
‘hy a partition within the left volume V; of the 
box. (6) Final situation, a long time after the 
Partition is removed, when the gas is confined 
merely within the entire volume V; of the box. 
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tion of the ensemble of systems cannot be restored merely by reim- 
posing the former constraint while keeping the system isolated (i.e., 
while preventing the system from interacting with any other system 
with which it can exchange energy). 

Of course, if we focus attention on a single system of the ensemble, 
its initial situation might be restored if we waited long enough for the 
occurrence of the right kind of spontaneous fluctuation. If this fluc- 
tuation is such that the system at a particular instant of time is found 
only among the Ὡ; states originally accessible to it, then we could at 
that instant reimpose the original constraint and thus restore the sys- 
tem to its initial situation. But the probability of occurrence of such 
a fluctuation is ordinarily extremely small. Indeed, if we consider an 
ensemble of systems in equilibrium after the constraint has been re- 
moved, the probability P; of encountering in the ensemble a system 
which is found among only Q; of its states is 


R= (44) 


since there are Q; states accessible to the system. The probability of 
encountering, among repeated observations of a single system, an ob- 
servation corresponding to the desired fluctuation is then also given 
by (44). But in the usual case where Q; >> Q; [so that the final ensem- 
ble of systems differs very substantially from the initial one], (44) shows 
that spontaneous fluctuations allowing one to restore the initial situa- 
tion of a single system occur exceedingly rarely. 

We shall say that a process is irreversible if the initial situation of an 
ensemble of isolated systems having undergone this process cannot be 
restored by simply imposing a constraint. In accordance with this 
definition, the process in which an isolated system attains a new equi- 
librium situation after one of its constraints has been removed (thus 
changing the number of states accessible to it from Q; to ©) is irre- 
versible if 2; > Q;. Equivalently, our definition implies that a process 
is irreversible if an isolated system after this process has probability 
less than unity of being found in its original macrostate; indeed, in the 
usual case (where 2; > ;) this probability is utterly negligible. Our 
present definition of irreversibility is thus merely a more precise for- 
mulation of that given in Sec. 1.2 in terms of the fluctuations in time of 
a single isolated system. 

We can now also make more quantitative the notion of randomness 
introduced in Chap. 1. As a statistical measure of the degree of ran- 
domness of a system we can take the number of its accessible states 
actually occupied in an ensemble of such systems. The process of 


Fig. 3.9 Two systems A and A’ with fixed 
external parameters and free to exchange en- 
ergy. The combined system Α΄, consisting of 
A and A’, is isolated. 
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attaining a new equilibrium situation after removing a constraint of an 
isolated system thus results in increasing the randomness of the system 
if 2; > Q,; the process is correspondingly irreversible. 


Example 


Once the gas in our previous example 
has attained its final equilibrium situa- 
tion so that the molecules are essentially 
uniformly distributed throughout the en- 
tire box, the simple act of replacing the 
partition does not restore the original 
situation of an ensemble of such boxes. 
The molecules now located in the right 
part of the box still remain there. The 
process ensuing when the partition is re- 
moved is thus irreversible. 

To examine possible fluctuations that 
might restore the original situation in 
a particular box of gas in the ensemble, 


let us examine the probability P; that this 
box is found with all its molecules on its 
left side after the final equilibrium situa- 
tion has been attained. By virtue of (43) 
and (44), this probability is given by 


(45) 


and is thus fantastically small if V; > Vi 
and if Nis at all large.t The removal of 
the partition thus illustrates a typical ir- 
reversible process in which the random- 
ness of the gas is increased. 


The general point of view of this section can be illuminated usefully 
by two further examples which are prototypes illustrating the inter- 
action between macroscopic systems. 


Example (i) 


Consider an isolated system A* which 
consists of two subsystems A and A’ 
with fixed external parameters. (For in- 
stance, A and A’ might be a piece of 
copper and a block of ice, respectively.) 
Suppose that A and A’ are separated 
from each other in space so that they 
cannot exchange energy with each other. 
The system A* is then subject to a con- 
straint requiring that the energy E of A 
and the energy E’ of A’ must each sepa- 
rately remain constant. Accordingly, the 
states accessible to the total system A* 
are only those consistent with the condi- 
tions that A has some constant specified 
energy Εἰ and that A’ has some constant 
specified energy Ej. If there are 2° such 
states accessible to A* and if A* is ih 
equilibrium, then A* is found with equal 


probability in each one of these states. 

Imagine that the systems A and A’ 
are now placed in contact with each other 
so that they are free to exchange energy. 
Then the former constraint has been re- 
moved since the energies of A or A’ need 
no longer be separately constant; only 
the sum (E + E’) of their energies, i.e., 
the total energy of the combined system 
A*, must remain constant. As a result 
of removing the constraint, the number 
of states made accessible to A* ordinarily 
becomes much greater, say, equal to 27. 
Therefore, unless Q7 7. the system 
At is not in equilibrium immediately after 
A and A’ are placed in contact with each 
other. The energies of the systems A 
and A’ will, therefore, change (energy in 
the form of heat passing from one to the 


} Note that in the special case where the box is originally divided by the partition into 
two halves, V; = 2V; so that P; = 2-®. Thisis merely the result (1.1) obtained in Chap. 1 


by primitive reasoning. 


I 


wo 
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other) until A* attains its final equilib- 
rium situation where it is found with 
equal probability in each one of the 27 
states now accessible to it. 

Suppose that the systems A and A’ 
are now separated again from each other 
so that they can no longer exchange en- 
ergy. Although the former constraint 


Example (ii) 


Consider an isolated system A* consist- 
ing of two gases A and A’ separated by 
a piston that is clamped in position. The 
piston then acts as a constraint requiring 
that the states accessible to Α΄ be only 
those consistent with the conditions that 
the molecules of gas A lie within some 
fixed volume V; and that the molecules 
of gas A’ lie within some fixed volume 
V;. If there are ὩΪ such states accessible 
to A* and if A* is in equilibrium, then A* 
is found with equal probability in each 
one of these states. 

Imagine that the piston is now un- 
clamped so that it is free to move. The 
individual volumes of the gases A and A’ 
then no longer need be separately con- 
stant. Consequently, the number of 
states accessible to A* becomes ordinar- 
ily much greater, say, equal to QF. Un- 
less 27 = OF, the system A’ is thus not 
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has thus been restored, the initial situa- 
tion of A* has not been restored (unless 
ὩΣ = 9). In particular, the mean ener- 
gies of A and A’ in the ensemble are now 
different from their initial values E; and 
Ej. The preceding process of heat trans- 
fer between the systems is thus irre- 
versible. 


in equilibrium after the piston is un- 
clamped. Hence the piston will tend to 
move and the volumes of A and A’ will 
correspondingly change until A* attains 
its final equilibrium situation where it is 
found with equal probability in each one 
of the Q7 states now accessible to it. As 
would be expected (and as will be shown 
explicitly in Chap. 6), the final volumes 
of the gases A and A’ are then such that 
their mean pressures are equal, thus 
guaranteeing that the piston is indeed 
in mechanical equilibrium when it is in its 
final position. 

The preceding process is again clearly 
irreversible if 27 > Qj. Leaving A®* iso- 
lated while simply clamping the piston 
again, so that it is not free to move, does 
not restore the initial volumes of the 
gases. 


3.7 Interaction between Systems 


The preceding two examples illustrate specific instances where mac- 
roscopic systems interact with each other. Since the study of such 
interactions is of central importance,+ we shall end this chapter by 
examining explicitly the distinct ways in which macroscopic systems 
can interact. 

Consider two macroscopic systems A and A’ which can interact and 
thus exchange energy with each other. The combined system A*, 
consisting of A and A’, is then an isolated system whose total energy 
must remain constant. To describe the interaction between A and A’ 
in statistical terms, we consider an ensemble composed of a very large 


} Indeed, the whole discipline of thermodynamics, as the name indicates, deals with the 
macroscopic analysis of thermal and mechanical interactions and the macroscopic conse- 
quences that can be derived therefrom. 


Fig. 3.10 Two gases A and A’ separated by a 


movable piston. 


The combined system A*, 


consisting of A and A’, is isolated. 


Fig. 3.11 An ensemble of systems A*, each 
consisting of two systems A and A’ capable of 
interacting with each other. 

ye 
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number of systems similar to A*, each consisting of a pair of interacting 
systems A and A’, The process of interaction between A and A’ does 
not ordinarily result in precisely the same energy transfer between A 
and A’ for each such pair of systems in the ensemble. We can, how- 


ever, meaningfully discuss the probability that an energy transfer of 


any specified magnitude results from the interaction process; or, more 
simply, we can merely ask for the mean energy transfer resulting from 
the interaction process. Let us then, in the ensemble of systems under 
consideration, denote the initial mean energies of A and A’ eae the 
interaction process by E; and Ej, respectively; and let us denote the 
final mean energies of A and A’ after the interaction process by Ey 
and Ej, respectively. Since the total energy of the isolated system A* 
consisting of A and A’ remains constant, it follows that 


E,+ Ej = Εἰ + Eis (46) 

ie., the conservation of energy implies simply that 
AE + AE’ =0 (47) 
where AE=E,—E; and AE =E;-E; (48) 


denote the changes of mean energy of each of the two systems A 
and A’. 

We can now refine our discussion of Sec. 1.5 by examining syste- 
matically the various ways in which two macroscopic systems A and 
A’ can interact. For this purpose, we shall inquire what happens to 
the external parameters of the systems during the interaction process. + 


Thermal interaction 

The interaction between the systems is particularly simple if all their 
external parameters are kept fixed so that their energy levels remain 
unchanged. We shall call such an interaction process thermal inter- 


“action. [Example (i) at the end of the preceding section provides a 


specific illustration.] The resulting increase (positive or negative) of 
the mean energy of a system is called the heat absorbed by the system 
and is conventionally denoted by the letter Q. Correspondingly, the 
decrease (positive or negative) of the mean energy of a system is called 


{As mentioned at the beginning of Sec. 3.2, an external parameter of a system is a 
macroscopic parameter (such as the applied magnetic field B or the volume V) which affects 
the motion of the particles in the system and thus also the energy levels of this system. 
The energy E, of each quantum state r of a system thus depends ordinarily on all the 
external parameters of the system. 
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the heat given off by the system and is given by —Q. Thus we can 
write 


and 


Q=AE ΟἾΞΞΔΕ’ (49) 


for the heat Q absorbed by system A and the heat Q' absorbed by sys- 
tem A’, respectively.t The conservation of energy (47) implies then 
that 

Ο-͵ 6’ ΞΟ, 
or 9- --΄. 
This last relation asserts merely that the heat absorbed by A must be 
equal to the heat given off by A’. In accord with the definitions 
already introduced in connection with Eq. (1.15), the system absorb- 
ing a positive amount of heat is said to be the colder system, while the 
system absorbing a negative amount of heat (or giving off a positive 
amount of heat) is said to be the warmer or hotter system. 

The characteristic feature of thermal interaction, where all external 
parameters are kept fixed, is that the energy levels of the systems re- 
main unchanged while energy on the atomic scale is transferred from 
one system to the other. The mean energy of one of the systems 
increases then at the expense of the other not because the energies of 
its possible quantum states have changed, but because the system after 
the interaction is more likely to be found in those of F its states which 
have higher energy. 


(50) 


Thermal insulation (or adiabatic isolation) 

Thermal interaction between two systems can be prevented if they 
are properly separated from each other. Two systems are said to be 
thermally insulated or adiabatically isolated from each other if they 
cannot exchange energy as long as their external parameters are kept 
fixed.{ Thermal insulation can be achieved by separating the systems 
sufficiently in space, or can be approximated by separating them from 
each other by a sufficiently thick partition made of a suitable material 
(such as asbestos or fiberglass). Such a partition is said to be thermally 
insulating or adiabatic if any two systems separated by it are found 


+ Note that, since our present discussion has been more careful in its use of statistical 
concepts than that of Sec. 1.5, we have now defined heat in terms of the change of mean 
energy of a system. 

4 The word adiabatic, meaning “heat is not able to go through,” comes from the Greek 
word adiabatikos (a, not + dia, through + bainein, to go). We shall always use the word 
in this sense, although it is sometimes also used in physics to denote a different concept. 
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Fig. 3.12 The effect of thermal interaction 
ona very simple system A consisting of a single 
spin ὁ having a magnetic moment Ho and lo- 
cated in a magnetic field B. The diagram 
shows the two possible energy levels of A. 
These two quantum states are labeled by + 
and — and their corresponding energies are 
denoted by E, and Ε΄. The probability of 
finding A in a given state is denoted by P, and 


.P_. respectively, and its magnitude is indicated 


graphically by the length of the gray bar. The 
energy levels remain unchanged since the 
applied magnetic field (the one external pa- 
rameter of this system) is supposed to be fixed. 
The initial equilibrium situation (@) is one 


“where the spin is embedded in some solid, 


The solid, with its included spin, is then im- 
mersed in some liquid and one waits until the 
final equilibrium situation (6) is attained. In 
this process the spin system A absorbs heat 
from the system A’ consisting of the solid and 
liquid. If the probabilities change as indicated 
in the diagram, the heat Q absorbed by A is 
equal to O = 0.6uoB. 


ΠΣ 


Fig 3.13 The effect of adiabatic interaction ὃ 
on a very simple system A consisting of a single 


spin ἃ having a magnetic moment pio and lo- + 


cated in a magnetic field B. The initial situa- 
tion and notation is the same as in Fig. 3.12, 
but the spin is then adiabatically isolated. Sup- 
pose that the magnetic field B is changed by 
means of an electromagnet. The amount of 


τ work done depends in general on just how the 


process is carried out. Panel (b) shows the 
final equilibrium situation if the magnetic field 
is changed from B to By very slowly; the work 
W done on A is then W = —0.8p0(B; — B). 


=> Panel (c) shows a final equilibrium situation 


which might result if the magnetic field is 
changed from B to B, in some arbitrary way; 
the work done in the particular case shown 
here is W = —0.4uoB; + 0.8yoB. 
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to be thermally insulated from each other; i.e., if any such systems, 
initially in equilibrium, are found to remain in equilibrium as long as 
their external parameters are kept fixed.t A process which occurs 
while a system is thermally insulated from all other systems is called 
an adiabatic process of this system. 


Adiabatic interaction 6“ 5 Ὁ 
When two systems A and A’ are thermally insulated from each other, 
they can still interact and thus exchange energy with each other, pro- 


+ 


vided that at least some of their external parameters change in the ὕμδοι 


process. We shall call such an interaction process adiabatic interac- 
tion. [Example (ii) at the end of Sec. 3.6 provides a specific illustration, 
if the piston is made of thermally insulating material.] The increase 
(positive or negative) of the mean energy of an adiabatically isolated 
system is called the macroscopic work done on the system} and will 


+ be denoted by W. Correspondingly, the decrease (positive or nega- 


tive) of the mean energy of a system is called the macroscopic work 
done by the system and is given by —W. Hence we can write 


W=AE and W’=AE’ (51) 


for the work done on system A and the work done on system A’, 
respectively. If the total system A + A’ is isolated, the conservation 
of energy (47) implies then that 


W+w=0 (52) 
or Ἅ)Ξ- -Ὑ. 


This last relation asserts merely that the work done on one system 
raust be equal to the work done by the other system. 

Since adiabatic interaction involves changes in some of the external 
parameters of the systems, at least some of the energy levels of these 
systems are changed in the process. The mean energy of one of these 
interacting systems is thus ordinarily changed both because the energy 
ot each of its states changes and because the probability that the sys- 
tem is found in any such state also changes.§ 


{If a partition is not thermally insulating, it is said to be thermally conducting. 

} The macroscopic work, defined in terms of a mean energy difference, is a statistical 
quantity equal to the mean value of the work done on each system in the ensemble. Where 
no confusion is likely to arise, we shall henceforth simply use the term work to denote the 
macroscopic work thus defined. 

§It is worth noting a special case. If a system is in an exact quantum state whose energy 
depends on an external parameter, then the system will simply remain in this state and 
have its energy changed accordingly if the external parameter is changed sufficiently slowly. 
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General interaction 

In the most general case, interacting systems are neither adiabatically 
isolated, nor are their external parameters kept fixed. It is then useful 
to write the total mean energy change of an interacting system, such 


as A, in the additive form 
AE=W+Q (53) 


where W represents the change of mean energy of A caused by changes 
of external parameters and Q represents the change of mean energy 
which is not caused by changes of external parameters. The decom- 
position (53) of AE into work W done on the system and heat Q 
absorbed by the system is meaningful when these contributions can 
be experimentally separated. Thus suppose that the system A inter- 
acts simultaneously with one system ΑἹ which is separated from A by 
a thermally insulating partition, and another system A2 whose external 
parameters are kept fixed. Then the work W in (53) must simply be 
equal to the work done by (or decrease in mean energy of) the system 
ΑἹ which is adiabatically isolated; similarly, the heat Q in (53) must 
simply be equal to the heat given off by (or decrease in mean energy 
of) the system A% whose external parameters are kept fixed. 

The general relation (53) is, for historical reasons, called the first 
law of thermodynamics. It explicitly recognizes work and heat as 
forms of energy transferred in different ways. Since both work and 
heat represent energies, these quantities are, of course, measured in 
units of energy, ie., typically in terms of ergs or joules.} 


Infinitesimal general interaction 

An interaction process is particularly simple if it is infinitesimal in the 
sense that it takes a system from an initial macrostate to a final macro- 
state which differs from the initial macrostate only infinitesimally. ‘The 
energy and external parameters of the system in the final macrostate 
differ then only very slightly from their values in the initial macrostate. 
Correspondingly, the infinitesimal increase in mean energy of the sys- 
tem can be written as the differential dE. In addition, we shall use 
the symbol @W, instead of W, to denote the infinitesimal amount of 
work done on the system in the process; this is merely a convenient 
notation designed to emphasize that the work is infinitesimally small. 
It is important to point out that dW does not designate any difference 


{In the older literature of physics, and much of the present-day literature of chemistry, 
‘one still finds heat measured in terms of an old unit, the calorie, introduced in the eigh- 
teenth century before it was realized that heat is a form of energy. The calorie is now 
defined so that 1 calorie = 4.184 joules exactly. 


4 Count Rumford (born Benjamin 


on, 1753-1814). Born in Massachu- 
I very much of an adventurer, he had 
sympathies during the American Rev- 
ind left America to serve for a time as 

of war to the Elector of Bavaria. 
is attention was attracted by the tem- 
+ rise produced by the boring of can- 
(observation which led him to suggest 
that heat is merely a form of motion of 
icles in a body. Although this was a 
Finsight, it was too qualitative to have 
apact on contemporary thinking which 
2d of heat as a substance (‘‘phlogiston”) 
onserved. (From a portrait in the Fogg 
‘seum, painted in 1783 by T. Gains- 
1, reproduced by courtesy of Harvard 
‘ity.) 


"-» 


134 Statistical Description of Systems of Particles 


between works. Indeed, that would be a meaningless statement. The 
work done is a quantity referring to the interaction process itself; one 
cannot talk of the work in the system before and after the process, or 
of the difference between these. Similar comments apply to dQ which 


* also denotes merely the infinitesimal amount of heat absorbed in a 


not any meaningless difference betwen heats. With this no- 


process, 
tation, the relation (53) for an infinitesimal process can be written in 
the form 

dE = aw + ἀρ. (64) 
Remark 
An infinitesimal process can be dis- dE = S\(P-dE,+E,dP,). (58) 


cussed particularly simply in statistical 
terms if it is carried out quasi-statically, 
ie., so slowly that the system remains 
always very close to equilibrium. Let P, 
denote the probability that the system A 
is in a state r having an energy E,. The 
mean energy of this system is then, by 
definition, 

Ἐ-Ξ:Σ ΡῈ, (55) 

τ 

where the sum extends over all possible 
states r of the system. In an infinitesi- 
mal process the energies Εν change only 
by small amounts as 8 result of changes 
in the external parameters; furthermore, 
if the process is carried out very slowly, 
the probabilities P, also change at most 
by small amounts. Hence the mean en- 
ergy change in the process can be writ- 
ten as the differential 


τ 


The heat absorbed corresponds to the 
increase of mean energy resulting when 
the external parameters are kept fixed, 
ive., when the energy levels E, are kept 
fixed so that dE,=0. Thus one can 
write 

ἀρ => Er dP,. (87) 

τ 

Hence the work done on the system is 
given by 


aw=dE—dQ=> P,dE, (58) 


This infinitesimal work is thus merely 
the mean energy change resulting from 
the shift of the energy levels produced 
by the infinitesimal change of external 
parameters, the probabilities P, retaining 
their initial values appropriate to the 
equilibrium situation. 
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Summary of Definitions 
(Some of these definitions are more precise Ἐν, οι 4s definitions already encountered in 
previous chapters.) 


microstate (or simply state) A particular quantum state of a system. It corresponds to 
the most detailed possible specification of a system described by quantum mechanics. 

macrostate (or macroscopic state) A complete specification of a system in terms of 
macroscopically measurable parameters. 

accessible state Any microstate in which a system can be found without contradicting 
the macroscopic information available about the system. 

number of degrees of freedom The number of distinct quantum numbers necessary to 
describe completely the microstate of a system. It is equal to the number of independ- 
ent coordinates (including spin coordinates) of all the particles in the system. 

external parameter A macroscopically measurable parameter whose value affects the 
motion of the particles in a system and thus the energies of the possible quantum states 
of the system. 

isolated system A system which does not interact with any other system so as to ex- 
change energy with it. 

total energy ofasystem The sum of the kinetic and potential energies of all the particles 
in the system. 

internal energy of a system The total energy of a system measured in the frame of ref- 
erence where its center of mass is at rest. 

equilibrium An isolated system is said to be in equilibrium if the probability of finding 
the system in any one of its accessible states is independent of time. (The mean values 
of all macroscopic parameters of the system are then independent of time.) 

constraint A macroscopic condition to which a system is known to be subject. 

irreversible process A process which is such that the initial situation of an ensemble of 
isolated systems subjected to this process cannot be restored by simply imposing a 
constraint. 

reversible process A process which is such that the initial situation of an ensemble of 
isolated systems subjected to this process can be restored by simply imposing a constraint. 

thermal interaction Interaction in which the external parameters (and thus also the 
energy levels) of the interacting systems remain unchanged. 

adiabatic isolation (or thermal insulation) A system is said to be adiabatically isolated 
(or thermally insulated) if it cannot interact thermally with any other system. 

adiabatic interaction Interaction in which the interacting systems are adiabatically iso- 
lated. In this case the interaction process involves changes in some of the external 
parameters of the systems. 

heat absorbed by a system Increase in mean energy of a system whose external parame- 
ters are kept fixed. 

work done on a system Increase in mean energy of a system which is adiabatically 
isolated. 

cold A comparative term applied to the system which absorbs positive heat as a result 
of thermal interaction with another system. 

warm (or hot) A comparative term applied to the system which gives off positive heat 
as a result of thermal interaction with another system. 


Fig. 3.15 Julius Robert Mayer (1814-1878). 
A German physician, Mayer suggested in 
1842 the equivalence and conservation of all 
forms of energy, including heat. Although he 
made some quantitative estimates, his writing 
was too philosophical to be convincing and his 
work remained unrecognized for about twenty 
years. (From G. Holton and D. Roller, “Foun- 
dations of Modern Physical Science,” Addison- 
Wesley Publishing Co., Inc., Cambridge, Mass., 
1958. By permission of the publishers.) 
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Important Relations 
Relation between mean energy, work, and heat: 


AE=W+Q @) 


Suggestions for Supplementary Reading 
Completely macroscopic discussions of heat, work, and energy: 
M. W. Zemansky, Heat and Thermodynamics, 4th ed., secs. 3.1-3.5, 4.1-4.6 (MeGraw- 


Hill Book Company, New York, 1957). 
H_B. Callen, Thermodynamics, secs. 1.1-1.7 (John Wiley & Sons, Inc., New York, 1960). 


Historical and biographical accounts: 

G. Holton and D. Roller, Foundations of Modern Physical Science (Addison-Wesley Pub- 
lishing Company, Inc., Reading, Mass., 1958). Chapters 19 and 20 contain a historical 
account of the development of the ideas leading to the recognition of heat as a form of 
energy. 

8. Ὁ. Brush, Kinetic Theory, vol. I (Pergamon Press, Oxford, 1965). The historical intro- 
duction by the author and the reprints of original papers by Mayer and Joule may be of 
particular interest. 

8. B. Brown, Count Rumford, Physicist Extraordinary (Anchor Books, Doubleday & Com- 
pany, Inc., Garden City, N.Y., 1962). A short biography of Count Rumford. 


Problems 
81 Simple example of thermal interaction® 

Consider the system of spins described in Table 3.3. Suppose that, when 
the systems A and A’ are initially separated from each other, measurements show 
the total magnetic moment of A to be —3po and the total magnetic moment of 
A’ to be +419. The systems are now placed in thermal contact with each other 
and are allowed to exchange energy until the final equilibrium situation has been 
reached. Under these conditions calculate: 
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(a) The probability P(M) that the total magnetic moment of A assumes any 
one of its possible values M. 

(b) The mean value M of the total magnetic moment of A. 

(c) Suppose that the systems are now again separated so that they are no 
longer free to exchange energy with each other. What are the values of P(M) 
and M of the system A after this separation? 


3.2 One spin in thermal contact with a small spin system 

Consider a system A consisting of a spin having magnetic moment po, 
and another system A’ consisting of 3 spins + each having magnetic moment po. 
Both systems are located in the same magnetic field B. The systems are placed 
in contact with each other so that they are free to exchange energy. Suppose 
that, when the moment of A points up (i.e., when A is in its + state), two of the 
moments of A’ point up and one of them points down. Count the total number 
of states accessible to the combined system A + A’ when the moment of A 
points up, and when it points down. Hence calculate the ratio P_/P,., where 
P_is the probability that the moment of A points down and P,, is the probability 
that it points up. Assume that the total system A + A’ is isolated. 


3.3 One spin in thermal contact with a large spin system 

Generalize the preceding problem by considering the case where the 
system A’ consists of some arbitrarily large number N of spins }, each having 
magnetic moment μο. The system A consists again of a single spin ὁ with mag- 
netic moment μο. Both A and A’ are located in the same magnetic field B and 
are placed in contact with each other so that they are free to exchange energy. 
When the moment of A points up, n of the moments of A’ point up and the 
remaining π' = N — n of the moments of A’ point down. 

(a) When the moment of A points up, find the number of states accessible to 
the combined system A + A’. This is, of course, just the number of ways in 
which the N spins of A’ can be arranged so that n of them point up and n’ of 
them point down. 

(b) Suppose now that the moment of A points down. The total energy of the 
combined system A + A’ must, of course, remain unchanged. How many of 
the moments of A’ now point up, and how many of them point down? Corre- 
spondingly, find the number of states accessible to the combined system A + A’. 

(c) Calculate the ratio P_/P,, where P_ is the probability that the moment 
of A points down and P,, is the probability that it points up. Simplify your result 
by using the fact that n>1 and n’>1. Is the ratio P_/P, larger or smaller 
than unity if n > π 


34 Generalization of the preceding problem 
Suppose that in the preceding problem the magnetic moment of A had 
the value 2uo. Calculate again the ratio P_/P, of the probabilities that this 
τ moment points down or up. 
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3.5 Arbitrary system in thermal contact with a large spin system 

The considerations of the preceding problems can readily be extended 
to treat the following general case. Consider any system A whatsoever, which 
might be a single atom or a macroscopic system. Suppose that this system A is 
placed in thermal contact with a system A’ with which it is free to exchange 
energy. The system A’ is supposed to be located in a magnetic field B and to 
consist of N spins 4, each having a magnetic moment μο. The number N is as- 
sumed to be very large compared to the number of degrees of freedom of the 
relatively much smaller system A. When the system A is in its lowest state of 
energy Eo, we suppose that n of the moments of A’ point up and the remaining 
n’ = N — nmoments of A’ point down. Here n> land n’> 1, since all num- 
bers are very large. 

(a) When the system A is in its lowest possible state of energy Eo, find the 
total number of states accessible to the combined system A + A’. 

(b) Suppose now that the system A is in some other state, call it r, where it 
has an energy E, higher than Eo. In order to conserve the total energy of the 
combined system A + A’, there must then be (n + An) moments of A’ which 
point up, and (n — An) moments of A’ which point down. Express An in terms 
of the energy difference (E, — Eo). You can assume that (E, — Eo) > 1B. 

(c) When the system Avis in the state r with energy E,, find the total number 
of states accessible to the combined system A + A’. 

(ὦ Let Po denote the probability that the system A is in the state of energy 
Eo, and P, the probability that it is in the state r of energy E,. Find the ratio 
P,/Po. Use the approximation that An<n and An «π΄. 

(e) Using the results just derived, show that the probability P, of finding the 
system A in any state r having an energy E, is of the form 


Py = \Ca- 85: 


where C is a constant of proportionality. Express in terms of poB and the 
ratio n/n’. 

(f) Ifn > π΄, is B positive or negative? Suppose that the system A is such 
that its states, labeled by a quantum number r, are equally separated in energy 
by an amount b. (For example, A might be a simple harmonic oscillator.) Thus 
€, =a + br, where r= 0,1,2,3... and a is some constant. Compare the 
probability of finding the system A in any one of these states with the proba- 
bility that it is found in its lowest state r = 0. 


3.6 Pressure exerted by an ideal gas (quantum mechanical calculation) 
Consider a single particle, of mass m, confined within a box of edge 
lengths L,, Ly, Lz. Suppose that this particle is in a particular quantum state r 
specified by particular values of the three quantum numbers nz, ny, Nz The 
energy E, of this state is then given by (15). 
When the particle is in the particular state r, it exerts on the right wall of the 
box (ie., the wall x = L,) some force F, in the x direction. This wall-must then 
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exert on the particle a force —F, (ie., in the —x direction). If the right wall of 
the box is slowly moved to the right by an amount dL,, the work performed on 
the particle in this state is thus —F,dL, and must be equal to the increase in 
energy dE, of the particle in this state. Thus one has 


dE, = —F, dL;. () 


The force Ἐν exerted by a particle in the state r is thus related to the energy E, 
of the particle in this state by 


R= -, ti) 
Here we have written a partial derivative since the dimensions Ly and L, are 
supposed to remain constant in deriving the expression (ii). 

(a) Using (ii) and the expression (15) for the energy, calculate the force F exerted 
by the particle on the right wall when the particle is in a state specified by given 
values of nz, ny, and nz. 

(b) Suppose that the particle is not isolated, but is one of the many particles 
which constitute a gas confined within the container. The particle, being able 
to interact weakly with the other particles, can then be in any one of many pos- 
sible states characterized by different values of n;, ny, and nz. Express the mean 
force F exerted by the particle in terms of n,?. For simplicity, assume that the 
box is cubic so that L; = Ly = L, = L; the symmetry of the situation then im- 
plies that n2 = n,? *, Use this result to relate F to the mean energy E of 
the particle. 

(c) If there are N similar particles in the gas, the mean force exerted by all of 
them is simply NF. Hence show that the mean pressure Ὁ of the gas (i.e., the 
mean force exerted by the gas per unit area of the wall) is simply given by 


2Np (iii) 


where E is the mean energy of one particle in the gas. 
(d) Note that the result (iii) agrees with that derived in (1.21) on the basis of 
approximate arguments using classical mechanics. 


3.7 Typical number of states accessible to a gas molecule 

The result (iii) of the preceding problem, or Eq. (1.21), permits one to 
estimate the mean energy of a gas molecule, such as nitrogen (N2), at room tem- 
perature. Using the density and the known pressure exerted by such a gas, the 
mean energy E of such a molecule was found in (1.28) to be about 6 x 10-14 erg. 

(a) Use (31) to calculate numerically the number of states &(E), with energy 
less than E, accessible to such a molecule enclosed in a box having a volume of 
one liter (103 cm3), 

(Ὁ) Consider a small energy interval δὲ = 10-24 erg, which is very much 
smaller than E itself. Calculate the number of states Q(E) accessible to the 
molecule in the range between E and E + δὲ. 

(c) Show that the preceding number of states is very large, despite the small 
magnitude of the energy interval 5E. 
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3.8 Number of states of an ideal gas 

Consider an ideal gas consisting of N particles confined within a box with 
edge lengths L,, L,, and L,. Here N is supposed to be of the order of Avogadro's 
number. By considering the energy contribution from each quantum number 
separately and using approximate arguments similar to those used in Sec. 3.5, 
show that the number of states 2(E) in a given energy interval between E and 
E + δῈ is given by 


Q(E) = CVNES2N δὲ 


where C is a constant of proportionality and V = L,L,L; is the volume of the 
box. 


*3.9 Number of states of a spin system 

A system consists of N spins ὁ, each having magnetic moment pio, and 
is located in an applied magnetic field B. The system is of macroscopic size so 
that N is of the order of Avogadro's number. The energy of the system is then 
equal to 


E= --ἰὦ -- πἼμοΒ 


if n denotes the number of its magnetic moments which point up, and π΄ 
the number which point down. 

(a) Calculate for this spin system the number of states 2(E) which lie in a small 
energy interval between Ε and E + δὲ. Here δὲ is understood to be large com- 
pared to individual spin energies, i.e., δὲ 3» poB. 

(b) Find an explicit expression for In Ὦ as a function of Ε. Since both n and 
n’ are very large, apply the result In τὶ ~ n In n — n derived in (M.10) to calcu- 
Jate both n! and πΊ. Show thus that, to excellent approximation, 


In QE) = Nin QN) — HN — E)In(N — ΕἼ -- ἘΝ + E)In(N + ΕἼ 


N-n 


ν i 

where e= rae 

(c) Make a rough sketch showing the behavior of In Q as a function of E. 
Note that {(£) does not always increase as a function of E. The reason is that 
a system of spins is anomalous in that it has not only a lowest possible energy 
E = —NyoB, but also ἃ highest possible energy E = Νμοβ. On the other hand, 
in all ordinary systems where one does not ignore the kinetic energy of the par- 
ticles (as we did in discussing the spins), there is no upper bound on the magni- 
tude of the kinetic energy of the system. 
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Chapter 4 


Thermal Interaction 


- The preceding chapter has provided us with all the postulates and 


essential theoretical framework necessary for a quantitative discussion 


. of macroscopic systems. We are, therefore, ready to explore the power 


of our concepts by applying them to some problems of major physical 
importance. 

We shall begin by examining in detail the thermal interaction be- 
tween systems. The analysis of this situation is particularly simple 


* since the external parameters, and hence also energy levels, of the 


Riv 


Leal) Sata 


systems remain fixed. In addition, thermal interaction is one of the 
processes occurring most commonly in the world around us. The par- 
ticular questions which we shall want to investigate are the following: 
What conditions must be satisfied so that two systems in thermal inter- 
action with each other are in equilibrium? What happens when these 
conditions are not satisfied? What probability statements can be 
made? We shall see that these questions can be answered quite simply 
to yield results of remarkable generality and great utility. Indeed, in 
this chapter we shall attain a good understanding of the concept of 
temperature and a precise definition of the “absolute temperature.” 
Furthermore, we shall obtain some eminently practical methods for 
calculating the properties of any macroscopic system in equilibrium 
on the basis of our knowledge about the atoms or molecules of which 
it is composed. Finally, we shall apply these methods to deduce ex- 
plicitly the macroscopic properties of some specific systems. 


4,1 Distribution of Energy between Macroscopic Systems 
Consider two macroscopic systems A and A’. We shall denote the 
energies of these systems by E and Ε΄, respectively. To facilitate the 


« counting of states, we proceed as in Sec. 3.5 and imagine the energy 


scales to be subdivided into very small equal intervals of fixed size 5E. 
(The magnitude of 5E is, however, supposed to be large enough to 
contain many states.) We shall then denote by Q(E£) the number of 
states accessible to A when its energy lies between E and E + 6E, and 
by 9.ΕἸ the numbef of states accessible to A’ when its energy lies 
between E’ and E’ + δὲ. The counting problem can be simplified 
since we can, to excellent approximation, tréat all energies as if they 
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could assume only discrete values separated by the small amount δὲ. 
In particular, we can lump together all states of A which have an en- 
ergy in the small interval between E and E + δὲ and treat them as if 
they simply had an energy equal to E; there are thus 2(E) such states. 
Similarly, we can lump together all states of A’ which have an energy 
in the small interval between E’ and E’ + δῈ and treat them as if they 
simply had an energy equal to Ε΄; there are thus Ὡ (ΕἼ such states. If 
τ we adopt this procedure, the statement that A has an energy E means Le \ystombontous  Sthrache of 


physically that A has an energy anywhere between E and E + δὲ. Ma So thtan aS .a ὠνὰς. 
Similarly, the statement that A’ has an energy Ε΄ means physically that — du ἀν 480 (GE) Sra ngmated 
A’ has an energy anywhere between Ε΄ and E’ + δὲ. Dif hertants in Ae wale Syste 


The systems A and A’ have fixed external parameters, but are as- _ (xg Evowsaaties ta all Gonstuants 
sumed to be free to exchange energy. (Any énergy transfer between « |. \9¢ Syste 
them is thus, by definition, in the form of heat.) Although the energy 
of each system separately is then not constant, the combined system 
A*=A + A’ is isolated so that its total energy E* must remain un- - 
changed. Hencet 


E + E' = E* = constant. (1) 


When the energy of A is known to be equal to E, the energy of A’ is 
then determined to be 


Ε' -- Ε' -- Εἰ. (2) Vs eae 


Rous yw. 


Now consider the situation where A and A’ are in equilibrium with hy Maoh Vhs 
each other, i.e., where the combined system A’ is in equilibrium. The Υ oy 
energy of A can then assume many possible values. The question of sh ὃ us 
interest is, however, the following: What is the probability P(E) that aoe hae τὶ 
the energy of A is equal to E (ie., that it lies in the interval between wer ade 
E and E + δ) where E has any specified value? [The energy of A’, Ἢ ἜΣ ΜΗΔ a 
of course, then has a corresponding value E’ given by (2).] The an- ~ Ton TN sepa. ἃ 
swer to this question is readily obtained by focusing attention on the - wie ob τὰ 
combined isolated system A*, since the basic postulate (3.19) asserts 
that such a system is equally likely to be found in each one of its acces- 
sible states. We have merely to pose the following question: Among 
the total number 2%; of states accessible to A*, what is the number 
Q*(E) of states of A* which are such that the subsystem A has an energy 


fn our discussion E denotes the energy of A irrespective of A’, and E’ denotes the 
energy of A’ irrespective of A. The total energy E* written as the simple sum (1) neglects, 


_ therefore, any interaction energy Εἰ which depends on both A and A’, ie., any work nec. 


‘essary to bring the systems together. By definition, thermal interaction is supposed to be 
sufficiently weak so that Εἰ is negligible, i.e., so that Εἰ «ζ΄ Ε and Εἰς Ε΄. 
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— ὡς gre measured in terms of an arbi- 
Τα Τα only a few values of (E) and QE’) 
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equal to E? By the general argument leading to (3.20), the desired 
probability P(E) then is simply given by 


- © δ" 
PE) Ξε τος, = CH) 8) 
where C = (Ὡ "οὴ Ὁ is merely some constant independent of E. 

The number 2*(E) can be readily expressed in terms of the numbers 


of states accessible to A and A’, respectively. When A has an energy. 


E, it can be in any one of its Q(E) possible states. The system A’ then 
must have an energy E’ given by (2) by virtue of the conservation of 
energy. Hence A’ can be in any one of the 2/(E’) = 9’(E* — E) states 
accessible to it under these conditions. Since every possible state of 
A can be combined with every possible state of A’ to give a different 
possible state of the total system A*, it follows that the number of dis- 
tinct states accessible to A* when A has an energy E is simply given 
by the product 

Q*(E) = QE) 5." — E). (4) 


Correspondingly the probability (3) that the system A has an energy 
E is simply given by 


P(E) = CME) 5." — Ε). (5) 


Example 


The following simple example uses very 
small numbers unrepresentative of real 
macroscopic systems, but serves to illus- 
trate the essential ideas of the preceding 
paragraphs. Consider two special sys- 
tems A and A’ for which Q(E) and 9.7 
depend on their respective energies E 
and E’ in the manner shown in Fig. 4.1. 
Here the energies E and E’ are measured 
in terms of some arbitrary unit and are 
subdivided into unit energy intervals. 
Suppose that the combined energy E* of 
both systems is equal to,13 units. A 
possible situation would one where ™ 
E = 3; it then follows that Ε' -- 10. In 


this case A can be in either one of its 2 
possible states, and A’ in any one of its 
40 possible states. There are then a 
total of Ὡ" = 2 x 40 = 80 different pos- 
sible states accessible to the combined 
system A*. Table 4.1 enumerates sys- 
tematically the possible situations com- 
patible with the specified total energy E*. 
Note that it would be most probable in a 
statistical ensemble of such systems to 
find the combined system A* in a state 
where E= 5 and E’ = 8. This situation 
would be likely to occur twice as fre- 
quently as the situation where E = 3 and 
F=10. 


Let us now investigate the dependence of P(E) on the energy E. 
Since A and A’ are both systems with very many degrees of freedom, 


we know by (3.37) that 2(E) and ’(E) are extremely rapidly increas- "- 
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ing functions of E and E’, respectively. Considering the expression 
(5) as a function of increasing energy E, it thus follows that the factor 
Q(E) increases extremely rapidly while the factor 2/(E* — E) decreases 
extremely rapidly. The result is that the product of these two factors, 
ie., the probability P(E), exhibits a very sharp maximum} for some 
particular value E of the energy‘E. Thus the dependence of P(E) on 
E must show the general behavior illustrated in Fig. 4.2 where the 
width AE of the region where P(E) has appreciable magnitude is such 
that AE< E. 

It is actually more convenient to investigate the behavior of In P(E) 
rather than that of P(E) itself since the logarithm is a much more slowly 
varying function of E. In addition, it follows by (5) that this logarithm 
involves the numbers 2 and ’ as a simple sum rather than as a prod- 
uct, ie, 


In P(E) = nC + nQ(E) + MQ(E) 6 


where E’= E*—E. The value E = E which corresponds to the 
maximum of In P(E) is determined by the conditiont 


dlnP _1 OP _ 
Ἔ Pee @ 


and thus corresponds also to the maximum of P(E) itself. By using 
(6) and (2), the condition (7) becomes simply ς 


81η Ὡ(Ε), ane’) 
8Ε aE” 


or BE) = BE) (8) 


where we have introduced the definition 


+2 (-1)=0 


pe) =2R8 1 δ ©) 


and the corresponding definition for B'(E’). The relation (8) is thus 
the fundamental condition which determines the particular valu 
of the energy of A (and corresponding value E’ = E* — E of the en- 
ergy of A’) which occurs with the greatest probability P(E). 


+ Note that the behavior of P(E) is analogous to that of the preceding simple example, 
except for the fact that the maximum of P(E) is enormously sharper for macroscopic sys- 
tems where 9(E) and 9'(E’) are such rapidly varying functions. 

{ We write this as a partial derivative to emphasize that all external parameters of the 
system are supposed to remain fixed throughout the discussion. 


Table 4.1 Enumeration of the possible num- 
bers of states compatible with a specified total 
energy E* = 13 of the systems A and A’ 
described in Fig. 4.1. 


E 


AK AE 
Fig. 4.2 Schematic illustration showing the 
dependence of the probability P(E) on the 
energy E. - 


E E 


Fig. 4.3 Schematic sketch of the dependence 
of In Q(E) and of In 2’(E’) = In Q(E* — E) on 
the energy E. By virtue of (3.38), the energy 
dependence of ἴῃ Ὦ is roughly of the form 
In Q(E) ~ fn E — Eo) + constant. Since the 
curves are thus concave downward, their sum 
(shown in white) exhibits a unique maximum 
at some value E. This gentle maximum of the - 
slowly varying logarithm of P(E), given by (6), 
corresponds then to an extremely sharp maxi- 
mum of P(E) itself. 
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Sharpness of the maximum of P(E) 


By examining the behavior of in PE) near 
its maximum, one can readily estimate 
how rapidly P(E) decreases when E be- 
comes different from E. Indeed, it is 
shown in Appendix A.3 that P(E) becomes 
negligibly small compared to its maxi- 
mum value when E differs from E by 
appreciably more than an amount AE 
whose magnitude is roughly of the order 


Ε 
ΔΕ ---Ξ-. 10) 

VF (10) 
Here f is the number of degrees of free- 
dom of the smaller of the two interacting 
systems and E is assumed to be much 
greater than the lowest (or ground state) 
energy of A. For a typical system con- 
sisting of a mole of atoms, f is of the 


order of Avogadro's number so that 
7-- 10 and 


AE ~ 10-28. (a) 


Hence the probability P(E) has ordinarily 
an exceedingly sharp maximum at the 
value E, becoming negligibly small when 
Ediffers from E by as little as a few parts 
in 1013, Thus the energy of A practically 
never differs appreciably from E; in par- 
ticular, the mean value E of A must then 
also be equal to £, i.e., E = E. Here we 
encounter another instance where the 
relative magnitude of the fluctuations of 
@ quantity about its mean value is ex- 
ceedingly small when one is dealing with 
a system consisting of very many par- 
ticles. 


Some conventional definitions 


- The preceding discussion shows that the quantities In 2 and β of the 


system A (and the corresponding quantities of A’) are of crucial impor- 
tance in the study of thermal interaction. It is, therefore, convenient 
to introduce some alternative symbols and conventional names for 
these quantities, 

Note first that the parameter £ has, according to its definition (9), 
the dimensions of a reciprocal energy. It is thus often useful to express 
β΄" as a multiple of some positive constant k which has the dimensions 
of energy and thus can be expressed in ergs. (This constant is called 
Boltzmann's constant and its magnitude can be chosen once and for 
all in some convenient way.) The parameter B-} can then be written 
in the form 


(12) 


where the quantity T thus defined provides a measure of energy in 
units of the quantity k. This new parameter T is called the absolute 
temperature of the system under, consideration and its magnitude is 
commonly expressed irf terms of degrees.t The physical reason for 
the name “temperature” will become more apparent in Sec. 4.3. 


} For example, an absolute temperature 
of 5k. 


of 5 degrees corresponds merely to an energy 


“ 
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By virtue of (9), the definition of T in terms of InQ can now be 
written in the form 
= 05: (13) 
where we have introduced the quantity S defined by 


(14) 


This quantity S is called the entropy of the system under consideration. 
It has the dimensions of energy because its definition involves the con- 
stant k. According to its definition (14), the entropy of a system is 
merely a logarithmic measure of the number of states accessible to the 
system. In accordance with the comments at the end of Sec. 3.6, the 
entropy thus provides a quantitative measure of the degree of random- 
ness of the system.t 

In terms of the previous definitions, the condition that the proba- 
bility Ρ(Ε) be maximum is, by virtue of (3), equivalent to the statement 
that the entropy S* = kIn Ὧ" of the total system is maximum with 
respect to the energy E of the subsystem A, Using (6), the condition 
of maximum probability is thus equivalent to the statement that 


S =S +S’ = maximum. (15) 
This condition is fulfilled if (8) is satisfied, i.e., if 
T=T. (16) 


Our discussion thus shows that the energy E of A adjusts itself in such 
a way that the entropy of the total isolated system A* is as large as 
possible. The system A* is then distributed over the largest possible 
number of states, i.e., it is in its most random macrostate. 


4.2 The Approach to Thermal Equilibrium 

As we have seen, the probability P(E) has an exceedingly sharp maxi- 
mum at the energy E= E. Therefore, in the equilibrium situation 
where A and A’ are in thermal contact, the system A almost always has 
an energy E extremely close to E, while the system A’ correspondingly 
has an energy E’ extremely close to E’ = E* — E. To excellent ap- 


# Note that the entropy defined by (14) has a definite value which is, by virtue of (3.40), 
essentially independent of the magnitude of the energy subdivision interval δ used in 
our discussion. Furthermore, since δῈ is some fixed interval independent of E, the deriva- 
tive (9) defining f or T is certainly also independent of 8E. 


ivr 
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proximation, the respective mean energies of the systems are then also 
equal to these energies, i.e., 


E=E and Ε’-Ε'. (17) 


Now consider a situation where the systems A and A’ are initially 
isolated from each other and separately in equilibrium, their mean 
energies being E; and E;’, respectively. Suppose that A and A’ are 
then placed in thermal contact so that they are free to exchange energy 
with each other. The resulting situation is ordinarily an extremely 
improbable one, except in the special case where the respective ener- 
gies of the systems were initially very close to Eand Ε΄. Inaccordance 
with our postulate (3.18), the systems will then tend to exchange energy 
until they attain the ultimate equilibrium situation discussed in the 
preceding section, The final mean energies E; and E; of the systems 
will then, by (17), be equal to 


E,=E and E/=EF' (18) 
so that the probability P(E) becomes maximum. The B parameters of 
the systems will then correspondingly be equal, i.e., 

By = Bf (19) 
where B=B(E) and B/=B(Es). 


The conclusion that the systems exchange energy until they reach 
the situation of maximum probability P(E) is, by virtue of (6) and the 
definition (14), equivalent to the statement that they exchange energy 
until their total entropy becomes maximum. Thus the final proba- 
bility (or entropy) can never be less than the original one, i.e., 


S(E;) + S(E/) > S(Ei) + 5.5 


or AS + AS’ > 0, (20) 


where AS= S(E,) — S(Ei) 
and AS’ = S'(E/) — S(Ei’) 
denote the entropy changes of A and A’, respectively. 
In the process of exthanging energy, the total energy of the systems 


is, of course, conserved. In accordance with (3.49) and (3.50), it thus 
follows that 


(21) 


where Q and Q’ denote the heats absorbed by A and A’ respectively. 
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The relations (20) and (21) summarize completely the cor which 
must be satisfied in any process of thermal interaction. 
Our discussion shows that there are two possible cases which may 


arise: 


(i) The initial energies of the systems are such that 8; = β΄, where 
8; = βί(Εὴ and B;’ = A(Ei’). The systems are then already in their 
most probable situation, i.e., their total entropy is already maximum. 
Hence the systems remain in equilibrium and no net exchange of heat 
occurs between them. 


(ii) More generally, the initial energies of the systems are such that 
8: ~ βι΄. The systems are then in a rather improbable situation where 
their total entropy is not maximum. Hence the situation will change 
in time while energy, in the form of heat, is exchanged between the 
systems until the final equilibrium is reached where the total entropy 
is maximum and β, = /. 


4.3 Temperature 
In the last section, we noted that the parameter [or equivalently the 
parameter T = (kB)~*] has the following two properties: 


(i) If two systems separately in equilibrium are characterized by the 
same value of the parameter, then equilibrium will be preserved and 
no heat transfer will occur when the systems are brought into thermal 
contact with each other. 


(ii) If the systems are characterized by different values of the param- 
eter, then equilibrium will not be preserved and heat transfer will 
occur when the systems are brought into thermal contact with each 
other. 


These statements allow us to derive several important consequences. 


| In particular, they permit us to formulate in a precise and quantitative 


way the qualitative notions explored in Sec. 1.5. 

Imagine, for example, that there are three systems A, B, and C which 
are separately in equilibrium. Suppose that no heat transfer occurs 
when C is placed in thermal contact with A, and that no heat transfer 
occurs when C is placed in thermal contact with B. Then we know that 
Bc = Ba and that Bc’ = Bx (where B,, Bz, and Bc denote the 8 param- 
eters of systems A, B, and C, respectively). But from these two equali- 
ties we can conclude that 84 = Bz, and hence that no heat transfer will 
occur if systems A and B are placed in thermal contact with each other. 
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Fig. 4.4 Constant-volume and constant-pres- 
sure gas thermometers. 
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Thus we are led to the following general conclusion: 


If two systems are in thermal equilibrium with a third 
system, then they must be in thermal equilibrium with 
each other. 


(22) 


The statement (22) is called the zeroth law of thermodynamics. Its 
validity makes possible the use of test systems, called thermometers, 
which allow measurements to decide whether any two systems will, or 
will not, exchange heat when brought into thermal contact with each 
other. Such a thermometer can be any macroscopic system M chosen 
in accordance with the following two specifications: 

(a) Among the many macroscopic parameters characterizing the 
system M, select one (call it @) which varies by appreciable amounts 
when M gains or loses energy by thermal interaction. All the other 
macroscopic parameters of M are held fixed. The parameter @ which 
is allowed to vary is called the thermometric parameter of M. 

(b) The system M is ordinarily chosen to be much smaller (i.e., to 
have many fewer degrees of freedom) than the systems which it is 
designed to test. This is desirable in order to minimize any energy 
transfer to these systems and thus to reduce to a minimum their dis- 
turbance by the testing process. 3 


Examples of thermometers 


There are many possible systems which thermometer. Here the thermometric 


can serve as thermometers. We men- 
tion only a few of those which are com- 
monly used. 

(i) A liquid, such as mercury or alco- 
hol, confined within a glass tube of small 
diameter. This is the familiar type of 
thermometer which we have already de- 
scribed in Sec. 1.5. Here the thermo- 
metric parameter @ is the height of the 
liquid in the tube. 

(ii) A gas confined within 8 bulb in 


such a way that its volumeis kept con- _ 
-volunte 


stant. This is called a comstant- 
gas thermometer. Here the thermometric 
parameter @ is the pressure exerted by 
the gas. 

(iii) A gas confined within a bulb in 
such a way that its pressure is kept con- 
stant. This is called a constant-pressure 


gas 
parameter @ is the volume occupied by 
the gas. 

(iv) An electrical conductor (e.g., 2 
coil of platinum wire) maintained at 
constant pressure and carrying a small 
current. This is called a resistance ther- 
‘mometer. Here the thermometric param- 
eter 6 is the electrical resistance of the 
conductor. 

(v) A paramagnetic sample main- 
tained at constant pressure. Here the 
thermometric parameter @ is the mag- 
netic susceptibility of the sample (i.e., 
the ratio of its mean magnetic moment 
per unit volume to the applied magnetic 
field.) This quantity can be determined, 
for example, by measuring the self-in- 
ductance of a coil wound around the 
sample. 


+ 
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A thermometer M is used in the following way. It is successively 
placed in thermal contact with the systems under test, call them A and 
B, and is allowed to come to equilibrium with each. 

(i) Suppose that the thermometric parameter θ of M (e.g., the length 
of the liquid column of a mercury-in-glass thermometer) assumes the 
same value in both cases. This implies that, after M has come to 
equilibrium with A, it remains in equilibrium after being placed in 
thermal contact with B. Hence the zeroth law allows one to conclude 
that A and B will remain in equilibrium if they are brought into thermal 
contact with each other. 

(ii) Suppose that the thermometric parameter @ of M does not 
assume the same value in both cases. Then one can conclude that A 
and B will not remain in equilibrium if they are brought into thermal 
contact with each other. To make the argument explicit, suppose that 
A and B did remain in equilibrium. After M attains thermal equilib- 
rium with A, it would then, according to the zeroth law, have to re- 
main in equilibrium when brought into thermal contact with B. But 
then the parameter 6 cannot change if M is brought into thermal con- 
tact with B, contrary to hypothesis.+ 

Consider any thermometer M with any one parameter 8 chosen as 
its thermometric parameter. The value assumed by @ when the ther- 
mometer M has come to thermal equilibrium with some system A will, 
by definition, be called the temperature of A with respect to the partic- 
ular thermometric parameter 6 of the particular thermometer M. Ac- 
cording to this definition the temperature can be a length, a pressure, 
or any other quantity. Note that, even if two different thermometers 
have parameters of the same kind, they do not ordinarily yield the 
same value of the temperature for the same body.{ Furthermore, if a 
body C has a temperature halfway between the temperatures of bodies 
A and B when measured by one thermometer, this statement is not 
necessarily true with respect to the temperatures measured by some 
other thermometer. Nevertheless, our discussion shows that the tem- 


+ All the preceding measurements could be performed with any other thermometer M’ 
having a thermometric parameter θ΄. There exists ordinarily a one-to-one relation between 
any value of @ and the corresponding value of 6’. In exceptional cases-a particular ther- 
mometer M might, however, be multiple-valued so that a given value of @ corresponds to 
more than one value of θ΄ for almost every other thermometer Μ΄. Peculiar thermometers 
which are multiple-valued in the experimental range of interest are rarely useful and will 
be excluded from our discussion. (See Prob. 4.1.) 

1 For example, the two thermometers might both consist of glass tubes filled with liquid, 
so that the length of the liquid column is the thermometric parameter in both cases. The 
liquid might, however, be mercury in one thermometer and alcohol in the other. 
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perature concept which we have defined has the following useful 
property: 


Two systems will remain in equilibrium when placed in 
thermal contact with each other if, and only if, they (23) 
have the same temperature with respect to the same 
thermometer. 


The temperature concept which we have introduced is important 
and useful, but is rather arbitrary in the sense that the temperature 
assigned to a system depends in an essential way on the peculiar prop- 
erties of the particular system M used as the thermometer. On the 
other hand, we can exploit the properties of the parameter β to obtain 
ἃ much more useful temperature. Indeed, suppose that we have a 
thermometer M whose parameter β is known as a function of its ther- 
mometric parameter 9. If this thermometer is placed in thermal con- 
tact with some system A, then we know that in equilibrium β = βα. 
Thus the thermometer measures, by virtue of (9), a fundamental prop- 
erty of the system A, namely the fractional increase of the number of 
its states with energy. Furthermore, suppose that we take any other 
thermometer M’ whose parameter β' is also known as a function of its 
thermometric parameter 6’. If this thermometer is placed in thermal 
contact with the system A, then we know that in equilibrium β΄ = Ba. 
Thus A’ = β and we arrive at the following conclusion: 


Ὑ «ΤῈ the parameter β is used as the thermometric parame- 
ter of a thermometer, then ΕΞ thermometer 
yields the same temperature Treading when used to 
measure the temperature of a particular, system, Fur- 

—|,thermore, this temperature measures a fundamental 
property of the number of states of the, system under test. 


The parameter β is, therefore, a particularly useful and fundamental 
temperature parameter. This is the reason for the name of absolute 
temperature given tg the cogresponding temperature parameter 
T= (Κβγ-1 defined in terms of 8. We shall postpone until the next 
chapter a discussion of the following two points: (i) practical proce- 
dures for finding numerical values of 8 or T by appropriate measure- 
ments, and (ii) the international convention adopted to assign a par- 
ticular numerical value to k. 
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Properties of the absolute temperature. 
By virtue of its definition (9), the absolute temperature is given by 
1 —~p— 22 
3 te, OE,” 69 


We saw in (3.37) that Q(E) is, for any ordinary system, an extremely 
rapidly increasing function of its energy E. Hence (25) implies that, 


for any ordinary system, 9» ar TS 0: (26) 
In other words, 


the absolute temperature of any 
ordinary system is positive.t 


We can also readily estimate the order of magnitude of the absolute 
temperature of a system. The approximate functional dependence of 
Q(E) is ordinarily of the form given by (3.38), 


Q(E) x (E — Eo)! (28) 


where fis the number of degrees of freedom of the system and E is the 
energy of the system whose ground state energy is Eo. Thus 


In@ ~ fin (E — Eo) + constant 


(27) 


so that 
(29) 


The magnitude of T can then be estimated by putting E = E, the 
mean energy of the system. Hence we can conclude that, 


for an ordinary system, r= 5~ 25%. (30) 


In other words, 


for any ordinary system at the absolute temperature T, 
the quantity kT is roughly equal to the mean energy 
(above the ground state) per degree of freedom of the 
system. 


(31) 


# As pointed out in connection with (3.38), the qualifying phrase for any ordinary system 
is meant to exclude specifically the exceptional case of a system where the kinetic energy 
of the particles is ignored and where their spins have a magnetic energy which is sufficiently 
high. 


Fig. 4.5 Schematic sketch showing the be- 
havior of In Ὦ as a function of the energy E. 
The slope of the curve is the absolute temper- 
ature parameter β. 
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The condition of equilibrium (8) between two systems in thermal 
contact asserts that their respective absolute temperatures must be 
+ equal. By virtue of (31) we see that this condition is roughly equiva- 
lent to the statement that the total energy of the interacting systems 
is shared between them in such a way that the mean energy per degree i 
i of freedom is the same for both systems. This last statement was 
essentially the one which we used in our qualitative discussion of 
Sec. 1.5. 
Τ How does the parameter β, or T, vary with the energy E of ἃ sys- 
tem? The quantity β measures the slope of the curve of In & versus E. 
We already noted in the caption of Fig. 4.3 that this curve must be 
concave downward so as to guarantee the physical requirement that a 
situation with a unique maximum probability results when two sys- 
tems are placed in thermal contact with each other. Hence it follows 
that the slope of the curve must decrease as E increases; i.e., 


op 
f we 
for any system, 3 <0. (32) 


τ In the case of an ordinary system, this result follows also from the ap- 
proximate functional form (28) since differentiation of (29) yields 
explicitly 


ap f 
oe See (38) 


Since we have just shown that £ decreases as E increases and since, 
by its definition T= (k8)1, the absolute temperature T increases as 
B decreases, we can then conclude from (32) that 


the absolute temperature of any system (34) 
is an increasing function of its energy. 


In more mathematical terms, 


ar _ 
GE GE 


so that (32) implies di 
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This last relation permits us to establish a general connection be- 
tween absolute temperatures and the direction of heat flow. Consider 
two systems A and A’ which are initially in equilibrium at different 
absolute temperatures T; and T;’, and which are then brought into 
thermal contact with each other. One system then absorbs heat, and 
the other gives off heat, until the systems attain a final equilibrium at 
some common absolute temperature Τὴ. Suppose that system A is the 
one which absorbs heat and thus gains energy; then it follows by (34) 
that T; > T;. Correspondingly system A’ must give off heat and thus 
lose energy; then it follows by (34) that T;< T;’. Hence the initial 
and final temperatures are such that 


ΤΙ « Τί « Ty. 


This means that the system A which absorbs heat has an initial abso- 
lute temperature T; which is less than the initial absolute temperature 
Tj’ of the system A’ which gives off heat. In short, 


when any two ordinary systems are placed in thermal 
contact with each other, heat is given off by the system 
with the larger absolute temperature and is absorbed by 
the system with the smaller absolute temperature.t 


(36) 


Since we defined the warmer system as the one which gives off heat 
and the colder system as the one which absorbs heat, (36) is equivalent 
to the statement that a warmer system has a higher absolute tempera- 
ture than a colder one. 


4.4 Small Heat Transfer 

The preceding sections complete our general discussion of the thermal 

interaction between macroscopic systems. We shall now turn our at- 

tention to some special simple cases which are of particular importance. 
Suppose that, when a system A is placed in thermal contact with 

some other system, it absorbs an amount of heat Q which is so small 

that 


I< E—E, ᾿ (37) 


ie., so that the resulting mean energy change AE = Q of A is small 
compared to the mean energy E of A above its ground state. The 


+ In the exceptional case of a system of spins, this statement may need to be qualified 
because T -Ὁ 500 when B + 0. See Prob. 4.30. 
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absolute temperature of the system A changes then by: a negligible 
amount. Indeed, putting E = E, Ὧ9) and (33) yield the estimates 


Fig; Sep aa eg Seek 
.- Sp =350~ τ πε΄ ἘΞ ΕΟ 
Hence (37) implies that ACE SEK) 
ssi =| Bol «8. (38) 


* Since T= (kB) or nT= —In -- Ink, it follows correspondingly 
that (AT/T) = —(AB/f) 80 that (38) is also equivalent to 


|AT| «1. (39) 


** We shall say that the heat Q absorbed by a system is small whenever 
(38) is valid, 1.6., whenever Q is sufficiently small so that the absolute 
temperature of the system remains essentially unchanged. 

ἊΣ Suppose that the system A absorbs such a small amount of heat Q. 
Its initial and final energies are then, with overwhelming probability, 
equal to their respective mean values E and E + Q. In the process 

» of absorbing this heat, the number of states (ΕἸ accessible to A also 
changes. By expanding in a Taylor series, we find 


ina + 9a) = (2gO)o+ HBB) e+ 


= 1Boy... 
=BQ+ gat: 


||| But since the absorbed heat Q is supposed to be small, the absolute 

| temperature of A remains essentially unchanged. Hence the term 
ἡ involving 08/2E can be neglected, in accordance with (38). The 
" change in the quantity In 2 then becomes simply 


ἜΣ: ἘΞ ρ-βρ. (40) 


In the process of absorbing an amount of heat Q, the entropy 
S = kin Q ofa system at the absolute temperature T = (kf) changes 
thus by an amount Saanich is,, 


a 
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We emphasize that, even if the heat Q is large in absolute magni- 
tude, it may yet be relatively small in the comparative sense of (37) 
or (39) so that the relation (41) remains valid. If the absorbed heat is 
actually infinitesimal in magnitude, we can denote it by dQ; the cor- 
responding infinitesimal entropy change is then equal to 


Note that the heat dQ is merely an infinitesimal quantity. The quan- 
tity dS is, however, an actual differential, i.e., an infinitesimal difference 
between the entropies of A in its final and initial macrostates. 

The heat Q absorbed by a system A will always be very small in the 
comparative sense of (37) or (39) when A is placed in thermal con- 
tact with any other system B sufficiently smaller than A itself. Indeed, 
any amount of heat Q which A can absorb from B then is at most of 
the order of the total energy of B (above its ground state) and is thus 
much smaller than the energy E — Ep of A itself. A system A is said 
to act as a heat reservoir (or heat bath) with respect to some other set 
of systems if it is sufficiently large so that its temperature remains essen- 
tially unchanged in any thermal interaction with those systems. Equa- 


tion (41) thus is always valid in relating the entropy change AS of ἃ. 


heat reservoir to the heat Q absorbed by it. 


4.5 System in Contact with a Heat Reservoir 
Most systems which one encounters in practice are not isolated, but 
are free to exchange heat with their environment. Since such a system 
is usually small compared to its environment, it constitutes a relatively 
small system in thermal contact with a heat reservoir provided by other 
systems in its environment. (For example, any object in a room, say 
a table, is in thermal contact with the heat reservoir consisting of the 
room itself with its floor, walls, other furniture, and enclosed air.) In 
this section we shall, therefore, consider any relatively small system A 
in contact with a heat reservoir A’, and ask the following detailed ques- 
tion about the small system A: Under conditions of equilibrium, what 
is the probability P, of finding the system A in any one particular state + 
of energy E,? = 

This is a rather important and very general kind of question. Note 
that in the present context the system A can be any system which has 


many fewer degrees of freedom than the heat reservoir Α΄. Thus A 


ive 


ὡ 


iw 


Ww 
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might be any relatively small macroscopic system. (For example, it 
might be a piece of copper immersed in the water of a lake, the latter 
acting as a heat reservoir.) Alternatively A might also be a distinguish- 
able microscopic system which can be clearly identified.{ (For exam- 
ple, it might be an atom located at a particular lattice site in a solid, 
the latter acting as a heat reservoir.) 

To facilitate counting the states of the reservoir A’, we again imagine 
its energy scale to be subdivided into small fixed intervals of magni- 
tude δῈ and denote by 2’(E’) the number of states accessible to A’ 
when its energy is equal to Ε΄ (ie., when it lies between E’ and 
E’ + 6E). (Here 6E is assumed to be very small compared to the sepa- 
ration between the energy levels of A, but to be large enough to con- 
tain many possible states of the reservoir A’.) It is then quite easy to 
use reasoning similar to that of Sec. 4.1 to find the desired probability 
P, that the system A is in its state r. Although the reservoir can have 
any energy E’, the conservation of energy applied to the isolated sys- 
tem A* consisting of both A and A’ requires that the energy of A* have 
some constant value, say Ε΄. When the system A is in its state r of 
energy E,, the reservoir A’ must then have an energy 


Ε΄ = Ε" --Ε, (43) 


But, when A is in this one definite state r, the number of states acces- 
sible to the combined system A* is simply the number of states 
Q'(E* — E,) which are accessible to A’. Our fundamental statistical 
postulate asserts, however, that the isolated system A* is equally likely 
to be found in each one of its accessible states. Hence the probability 
of occurrence of a situation where A is in the state r is simply propor- 


+ tional to the number of states accessible to A* when A is known to be 


in the state 1, i.e., 


Up to this point our discussion has been completely general. Let 
us now make use of the fact that A is very much smaller than the reser- 
voir A’ in the sense that any energy E, of interest to us satisfies the 


relation 4 


+ 
E,&E*. (45) 


} This qualifying remark is pertinent since it may not always be possible, in a quantum- 
mechanical description, to identify a particular particle among particles which are basically 
indistinguishable. 


"- 


in 


Sec. 4.5 159 


We can then find an excellent approximation for (44) by expanding the 
slowly varying logarithm of 2/(E’) about the value E’ = Ε΄. Analo- 
gously to (40), we thus get for the heat reservoir A’ 


In Q(E* — E,) = nQ(E*) — [2n2' |e, 


dE’ 
= InQ’(E*) — BE, (46) 
Here we have written 
= [26] τις (47) 


for the derivative evaluated at the fixed energy E’= Ε΄. Thus 
Β = (kT)? is simply the constant temperature parameter of the heat 
reservoir A’.{ Hence (46) yields the result 


Tot ΩἿΕ" — B) = Ὡ(ΕἾ er. (48) 


Since 2/(E*) is merely a constant independent of r, (44) then becomes 
simply 


where C is a constant of proportionality independent of r. 

Let us examine the physical content of the results (44) or (49). If 
A is known to be in a definite state τ, the reservoir A’ can be in any 
one of the large number ’(E* — E,) of states accessible to it under 
these conditions. But the number of states 2/(E’) accessible to the 
reservoir is ordinarily a rapidly increasing function of its energy E’ 
[i.e., B in (47) is ordinarily positive]. Suppose then that we compare 


|| the probabilities of finding the system A in any two of its states which 


have different energies. If A is in the state where its energy is higher, 
the conservation of energy for the total system then implies that the 
energy of the reservoir is correspondingly lower; hence the number of 
states accessible to the reservoir is markedly reduced. Accordingly, 
the probability of encountering this situation is very much less. The 
exponential dependence of P, on E, in (49) merely expresses the result 
of this argument in mathematical terms. 


+ For the sake of simplicity, we shall not embellish the symbol £ with a prime. 
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Fig. 4.6 Schematic illustration showing the 
states accessible to a particular system A and 
to a special (rather small) heat reservoir A’. 
The top diagram shows the energy levels cor- 
responding to a few distinct states of A. The 
bottom diagram shows, for a few values of Ε΄, 
the number of states 2’(E’) accessible to A’ as 
a function ofits energy Ε΄. The energy is meas- 
ured in terms of an arbitrary unit. 
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Example 


Let us illustrate the preceding comments 
with a simple example. Consider a sys- 
tem A which has energy levels some of 
which are shown in the top part of Fig. 
4.6. Consider also the much larger sys- 
tem A’ whose energy scale is subdivided 
into intervals of size δὲ = 1 unit and 
whose number of states 2’(E’) is shown 
as a function of its energy E’ in the bot- 
tom part of Fig. 4.6. We assume that A 
is in thermal equilibrium with the heat 
reservoir A’ and that the total energy Ε΄ 
οἵ the combined system A* has some 
value E* = 2050 units. Suppose that A 
is in the particular state r where its en- 
ergy E,= 10 units. The energy of the 
reservoir A’ must then be E’ = 2040; 
accordingly, A’ can be in any one of 


The probability (49) is a very 


2 x 108 possible states. 
ble containing many isolated systems A* 
consisting of A and A’, the number of in- 
stances where A is found in the state r 
will then be proportional to 2 x 106. On 
the other hand, suppose that A is in the 
particular state s where its energy E, = 
16 units. The energy of the reservoir A’ 
must then be E’ = 2034; accordingly, A’ 
can now be in any one of only 105 pos- 
sible states. In the ensemble of sys- 
tems, the number of instances where A 
is found in the state s will thus be pro- 
portional to 105 and will consequently be 
only half as large as the number of in- 
stances where A is in the state r of lower 
energy. 


general result of fundamental impor- 


tance in statistical mechanics. The exponential factor e~* is called 
the Boltzmann factor; the corresponding probability distribution (49) 


is known as the canonical distribution. An ensemble of systems all 
| which are in contact with a heat reservoir of known temperature 
] [1.6., all of which are distributed over their states in accordance’ with 


|| (49)] is called a canonical ensemble. 


+ The constant of propo: 
by the normalization condition 


ionality C in (49) can be readily determined 
that the system must have probability 


unity of being in some one of its states; i.e., 


ΣΡ,Ξ 1 
ξ 


(50) 


where the sum extends over all possible states of A irrespective of 
energy. By virtue of (49) this condition determines C to be such that 


΄ 


ΟἿ 688, = 1. 
τ 


x 
Hence (49) can be written in the explicit form 


(51) 


of | 
τὴ 


In an ensem- - 


| 
| 
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The probability distribution (49) allows us to calculate very simply 
the mean values of various parameters characterizing the system A in 
contact with a heat reservoir at the absolute temperature T = (k8)-. 
For example, let y be any quantity assuming the value y, in state r of 


the system A. Then the mean value of y is given by 


— SP yr = 
τ 


ePEry, 
—_—___ 


Som: (62) 


where the summation is over all states r of the system A. 


Remarks pertinent when A is a macroscopic system 


The fundamental result (49) gives the 
probability P, of finding A in any one par- 
ticular state r of energy E,. The proba- 
bility P(E) that A has an energy ina small 
range, say between E and E + δὲ, is then 
simply obtained by adding the probabili- 
ties for all states r whose energy E, lies 
in the range E< E, < E + 8E;ie., 


PE) =>"P, 


where the prime on the summation sym- 
bol indicates that the sum extends only 
over the states having nearly identical 
energies lying in this small range. But 
the probability P, is then, by (49), essen- 
tially the same for all these states and 
proportional to e~##. Hence the proba- 
bility P(E) of interest is obtained simply 
by multiplying {the probability of finding 
A in any one of these states} by (the 
number Ὡ(Ε) of its states in this energy 
range}; i.e., 


PE) = CME) e-FF. (53) 


To the extent that A itself is a large 
system (although very much smaller 
than A’), Q(B) is a rapidly increasing 
function of E. The presence of the 
rapidly decreasing factor e~£F in (53) re- 
sults then in a maximum of the product 
QE) 6.38. This maximum in P(E) is 
sharper the larger A is, i.e., the more 
rapidly Q(E) increases with E. Thus we 
arrive again at the conclusions obtained 
in Sec. 4.1 for a macroscopic system. 


When a system in contact with a heat 
reservoir is itself of macroscopic size, 


the relative magnitude of the fluctuations 


of its energy E is so exceedingly small 
that its energy is practically always equal 
to its mean value E. On the other hand, 
if the system were removed from contact 
with the heat reservoir and were ther- 
mally isolated, its energy could not fluc- 
tuate at all. The distinction between 
this situation and the previous one, how- 
ever, is so small that it is practically ir- 
relevant; in particular, the mean values 
of all physical parameters of the system 
(such as its mean pressure or mean 
magnetic moment) remain quite unaf- 
fected. Hence it makes no difference 
whether these mean values are calcu- 
lated by considering the macroscopic 
system to be an isolated system with a 
fixed energy in some small range be- 
tween E and E + δὲ, or by considering 
this system to be in thermal contact with 
@ heat reservoir of such a temperature 
that the mean energy E of the system is 
equaltoE. The latter point of view, how- 
ever, makes calculations much easier. 
The reason is that use of the canonical 
distribution reduces the computation of 
mean value to the evaluation of the sums 
(52) over all states without restriction; 
thus it does not require the much more 
difficult task of counting the number Q of 
states of a particular type lying in a spec- 
ified small energy range. 


6E 


Fig. 4.7 Schematic illustration showing the 

dependence of the function 2(E)e~4® on the 

energy E of a macroscopic system in contact 
+» with a heat reservoir. 
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Fig. 48 Ludwig Boltzmann (1844-1906). 
This Austrian physicist, who did pioneering 
work in statistical physics, contributed greatly 
to the development of the atomic theory of 
gases which he put in its modern quantitative 
form. His work in 1872 led to fundamental 
insights into the microscopic explanation of 
irreversibility, He also laid the foundation for 
statistical mechanics and introduced the basic 
relation 8 τε απ Ὁ connecting the entropy 
with the number of accessible states. Boltz- 
mann’s work came under heavy attack by a 
whole school of thought represented by such 
men as Ernst Mach (1838-1916) and Wilhelm 
Ostwald (1853-1932), who argued that physi- 
cal theory should deal only with macroscopi- 
cally observable quantities and should reject 
purely hypothetical concepts such as atoms. 
Discouraged, Boltzmann wrote in 1898, “I am 
conscious of being only an individual struggling 
weakly against the stream of time.” Increas- 
ingly subject to depressions, Boltzmann com- 
mitted suicide in 1906 only shortly before 
Perrin’s experiment on Brownian motion (1908) 
and Millikan’s oil-drop experiment (1909) pro- 
vided very direct evidence for the discrete 
atomic structure of matter. (Photograph by 
courtesy of Professor W. Thirring, University 
of Vienna.) 


“. 


Fig. 4.9 Josiah Willard Gibbs (1839-1903). 
The first American theoretical physicist of 
note, he was born and died in New Haven 
where he spent all his working life as professor 
at Yale University. In the 1870s he made fun- 
damental contributions to thermodynamics, 
couching its purely macroscopic reasoning in 
powerful analytical form and using it to treat 
many important problems of physics and chem- 
istry. Around 1900 he developed a very gen- 
eral formulation of statistical mechanics in 
terms of the concept of ensembles. Despite 
the modifications brought about by quantum 
mechanics, the basic framework of his formu- 
lation still stands and is essentially the one upon 
which we have built our systematic discussion 
beginning with Chap. 3. The name canonical 
ensemble originated with Gibbs. (Photograph 
by courtesy of the Beinecke Rare Book and 
Manuscript Library, Yale University.) 
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4.6 Paramagnetism 


τ The canonical distribution can be used to discuss a large number of 


situations of great physical interest. As a first application we shall in- 
vestigate the magnetic properties of a substance which contains No 
magnetic atoms per unit volume and which is placed in an external 
magnetic field B. We consider the particularly simple case where 
each magnetic atom has spin } and an associated magnetic moment pio. 
In a quantum-mechanical description the magnetic moment of each 
atom can then point either “up” (i.e., parallel to the external field) or 


“down” (i.e., antiparallel to the field). The substance is said to be — 


paramagnetic since its magnetic properties are due to the orientation 
of individual magnetic moments. Suppose that the substance is at an 
absolute temperature T. What then is ji, the mean component of the 
magnetic moment of one of its atoms along the direction of the 
magnetic field ΒΡ 

We assume that each magnetic atom interacts only weakly with all 
the other atoms of the substance. In particular, we assume that the 
magnetic atoms are sufficiently far separated from each other so that 
we can neglect the magnetic field produced at the position of one 
magnetic atom by a neighboring magnetic atom. It is then permissible 
to focus attention on a single magnetic atom as the small system under 
consideration and to regard all the other atoms of the substance as 
constituting a heat reservoir at the absolute temperature T of interest.} 

Each atom can be in two possible states: the state (+) where its 
magnetic moment points up, and the state (—) where its magnetic 
moment points down. Let us discuss these states in turn. 

In the state (+), the atomic magnetic moment is parallel to the field 
so that 1 = po. The corresponding magnetic energy of the atom is 
then ες = —poB. The canoncial distribution (49) thus yields for 
the probability P, of finding the atom in this state the result 

P, = (6:8: = CefuoB (54) 


where C is a constant of proportionality and 8 = (kT)-1. This is the 
state of lower energy and thus is the state in which the atom is more 
likely to be found. 


+ This assumes that it is possible to identify a single atom unambiguously, an assump- 
tion which is justified if the atoms are localized at definite lattice sites of a solid or if they 
form a dilute gas where the atoms are widely separated. In a sufficiently concentrated gas 
of identical atoms the assumption breaks down since the atoms are then indistinguishable 
in ἃ quantum-mechanical description. It would then be necessary to adopt a point of view 
(which is always permissible, although more complicated) which considers the entire gas 
of atoms as a small macroscopic system in contact with some heat reservoir. 


Fig. 4.10 An atom with spin } shown in ther- 
mal contact with a heat reservoir A’. When 
the magnetic moment of the atom points up, its 
energy is less by an amount 2y19B than when it 
points down; accordingly the energy of the 
reservoir is greater by an amount 2yoB so that 
the reservoir can be in many more states. 
Hence the situation in which the moment 
points up occurs with greater probability than 
that in which it points down. 


BoB 
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Fig. 4.11 Graph showing the probability P, 
that a magnetic moment pio points parallel (and 
the probability P_ that it points antiparallel) to 
an applied magnetic field B when the absolute 
temperature is T. 


= 
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In the state (—) the atomic magnetic moment is antiparallel to the 
field so that μ = —po. The corresponding energy of the atom is then 
«= +poB. The probability P_ of finding the atom in this state is thus 


P_ = σε 8- = (ε βμοβ, (55) 


This is the state of higher energy and thus is the state in which the atom 
is less likely to be found. 

The constant C is immediately determined by the normalization 
requirement that the probability of finding the atom in either one of 
these states must be unity. Thus 


P, + P_ = C(efnoB + e-PuoB) = 1 


= 1 
= cS efuoB 4 e—Bu0B * (56) 


Since the atom is more likely to be in the state (+) where its mag- 
netic moment is parallel to the field B, the mean magnetic moment μα 
must point in the direction of the field B. By virtue of (54) and (55), 
the significant parameter characterizing the orientation of the magnetic 
moment is the quantity 


= μοΒ ἊΣ 
τὸ = BuoB = VT (57) 
which measures the ratio of the magnetic energy 1B to the character- 
istic thermal energy kT. It is apparent that if T is very large (ie., if 
w<1), the probability that the magnetic moment is parallel to the 
field is almost the same as that it is antiparallel. In this case the mag- 
netic moment is almost completely randomly oriented so that 2 = 0. 
On the other hand, if T is very small (ie., if w> 1), it is much more 
probable that the magnetic moment is parallel to the field than anti- 
parallel to it. In this case | = μο. 
All these qualitative conclusions can readily be made quantitative 
by actually calculating, the mean value fi. Thus we find 


: 
a BuoB — e—BuoB “κι 
ΞΞΡΗμο) + Ρ. (-- μο) = bo eB 5: ἘΞ “ἀπ᾿ (58) 


Alternatively, this result can be written in the form 


(59) 
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where we have used the definition of the hyperbolic tangent 


=O eS 
tanh w = poaee (60) 
The mean magnetic moment per unit volume of the substance (1.6., its 
magnetization) points then in the direction of the magnetic field. Its 
magnitude Mo is simply given by 


Mo = Now (61) 


if there are No magnetic atoms per unit volume. 

We can easily verify that ji exhibits the behavior discussed previously 
in qualitative terms. If w<1, then ev =1+w+--- and 
e~=1—w+---. Hence, 


tah = τως )-ἀτ-ως..} 


forw< 1, ΕἸ 


το. 


On the other hand, if w>>1, then e™>e-”. Hence, 
for w>1, tanh w = 1. 
The relation (59) predicts thus the following limiting behavior: 


for poB< KT, ἘΞ 10 (422) = eB (62) 


for poB> kT, B= bo. (63) 


When pioB< KT, the value of ji is rather small. By (62), ji is then 
less than its maximum possible value po by the ratio (upB/kT). Note 
that ji in this limit is simply proportional to the magnetic field B and 
inversely proportional to the absolute temperature T. Using (61) and 
(62), the magnetization then becomes, 


Nowo?B __ 
πες - 


for poB< KT, Mo = Noi = xB (64) 


where x is a constant of proportionality independent of B. This 
parameter x is called the magnetic susceptibility of the substance.t 


# The magnetic susceptibility is conventionally defined in terms of the magnetic field H 
so that x= Mo/H. But since the concentration No of magnetic atoms is supposed to be 
small. H = B to excellent approximation. 


kT 


Fig. 4.12 Dependence of the magnetization 
Mo on the magnetic field B and temperature T 
in the case of weakly interacting magnetic 
atoms of spin 4 and magnetic moment jo. 
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Equation (64) thus provides the following explicit expression for x in 
terms of microscopic quantities: 


N 
kT 


The fact that x is inversely proportional to the absolute temperature is 
known as Curie’s law. 

When pioB > kT, the mean magnetic moment ji attains its maximum 
possible value jo. Correspondingly the magnetization becomes 


for μοΒ kT, Mo = Noo (66) 


which is its maximum possible (or saturation) value and is thus inde- 
pendent of B or T. The complete dependence of the magnetization 
Mp on the absolute temperature T and magnetic field B is shown in 
Fig, 4.12. 


(65) 


4.7 Mean Energy of an Ideal Gas 

Consider a gas of N identical molecules, each of mass m, confined 
within a box of edge lengths L,, Ly, and L,. Suppose that the gas is 
sufficiently dilute, i.e., that the number N of molecules in the given 
volume V = LyLyL, is sufficiently small so that the mean separation 
between the molecules is correspondingly large.. The following two 
simplifying conditions are then satisfied: 


(i) The mean potential energy of mutual interaction between the 
molecules is very small compared to their mean kinetic energy. 
(The gas is then said to be ideal.) 

(ii) It is permissible to focus attention on any one particular mole- 
cule as an entity which can be identified despite the essential 
indistinguishability of the molecules. (The gas is then said to 
be nondegenerate.)+ 


We shall assume that the gas is sufficiently dilute so that both of these 
conditions are satisfied.} 


Ὁ This last conclusion is based on the fact that the mean separation between the mole- 
cules is then large compared to the typical de Broglie wavelength of a molecule. If this 
is not the case, quantum-mechahical limitations make it impossible to focus attention un- 
ambiguously on a particular mBlecule and a rigorous quantum-mechanical treatment of a 
system of indistinguishable particles becomes essential. (The gas is then said to be degen- 
erate and is described by either the so-called Bose-Einstein or Fermi-Dirac statistics.) 

{ Condition (ii) is satisfied by practically all ordinary gases; the extent of its validity will 
be examined more quantitatively at the end of Sec. 6.3. When a gas is made less dilute, 
condition (i) usually breaks down long before condition (ii) is violated. If the interaction 
between molecules is very small, however, a gas may satisfy condition (1) so that it is ideal, 
but violate condition (ii). 


— 
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Suppose that the gas is in equilibrium at an absolute temperature T. 


* By virtue of condition (ii), we can focus attention on a particular 


molecule of the gas and regard it as a small system in thermal contact 
with a heat reservoir (at temperature ΤΊ consisting of all the other 
molecules of the gas. The probability of finding the molecule in any 
one of its quantum states r, where its energy is εν, is then simply given 
by the canonical distribution (49) or (51); i-e., 


P= where g=-l. (67) 
δ᾽ ever kT 


In calculating the energy εν of the molecule, the condition (i) allows 
us to neglect any energy of interaction of the molecule with other 
molecules. 

Consider, for instance, the particularly simple case of a monatomic 
gas [such as helium (He) or argon (Ar)] in which each molecule consists 
of a single atom only. The energy of such a molecule is then simply 
its kinetic energy. Each possible quantum state r of the molecule is 
accordingly labeled by some particular values of the three quantum 
numbers {n,,n,,n,} and has an energy given by Eq. (3.15). Thus 


e= 


πὴ (n2 | mn? ἘΞ) 
or (te + Ps + 75) (68) 


The probability of finding the molecule in any such state is then given 
by (67). 

On the other hand, consider the case of a polyatomic gas [such as 
oxygen (Oz), nitrogen (No), or methane (CH,)] in which each molecule 
consists of two or more atoms. Then the energy ε of each molecule is 
given by 


c= 4 dl, (69) 


Here εἴ is the kinetic energy of translational motion of the center of 
mass of the molecule, while εἴ is the intramolecular energy associated 
with the rotation and vibration of the atoms with respect to the center of 
mass. Since the center of mass moves like a simple particle having the 
mass of the molecule, the state of translational motion of the molecule 
is again specified by the set of three quantum numbers {nz,n,,nz} and 
its translational kinetic energy εἴ is again given by (68). The state of 


* intramolecular motion is specified by one or more other quantum num- 


bers (denote them collectively simply by nj) describing the state of 
rotational and vibrational motion of the atoms in the molecule; the 
energy e depends then on n;. A particular state r of the molecule is 


additonal 


d 
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then specified by specific values of the quantum numbers {nz,n,.n23ni} 
and has a corresponding energy εν given by 


ἐν = Mnznynz) + (ni). (70) 


Note that the translational motion of the molecule is affected by the 
presence of the confining walls of the container; hence « depends on 
the dimensions L;, Ly, Ly of the container, as seen explicitly in (68). 
On the other hand, the intramolecular motion of the atoms with re- 
spect to the molecular center of mass does not involve the dimensions 
of the container; hence εἰ is independent of the dimensions of the 
container, i.e., 


e is independent of L., Ly, Lz. (71) 


Calculation of the mean energy 
If a molecule is found with probability P, in a state r of energy εν then 
its mean energy is simply given by 
Dever bere, 
=> Be= (72) 
“Sem 

where we have used (67) and where the sums are over all possible 
states r of the molecule. The relation (72) can be considerably simpli- 


fied by noting that the sum in the numerator can be readily expressed 
in terms of the sum in the denominator. Thus we can write 


“pre. = — Mei ee we a “Aer 
ae Ep Le 3B (6 8:ὴ = B (Σ: A ) 
where we have used the fact that the derivative of a sum of terms is 


equal to the sum of the derivatives of these terms. Introducing for the 
denominator in (72) the convenient abbreviation 


ZED ew, (73) 


the relation (72) becomes then 


or (74) 


i 
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The calculation of the mean energy é thus requires only the evaluation 
of the single sum Z of (73). (The sum Z.over all states of the molecule 
| is called the partition function of the molecule.) 
In the case of a monatomic gas, the energy levels are given by (68) 
and the sum Z of (73) becomes accordingly + 


Υ̓ Bei (πος πᾶς, at 
B= ms Sev on NE τ) ΓΝ 


Te “Ty Te 


where the triple sum is over all possible values of nz, n,, and ny [each 
ranging over all integral values from 1 to oo in accordance with (3.14)]. 
But the exponential function factors immediately into a simple product 
of exponentials, i.e., 


Bh? (Ξ συ 53}} 
exp[- am ἈΠ + LF ΤῚΣ 


= ow - te] oe [- AS 8 Jon Po 2] 


Here n, appears only in the first factor, ny only in the second, and n, 
only in the third. Hence the sum (75) breaks up into a simple product, 


ie., 


Z = ZLyZe (76) 
where Z= Py exp [- Bev is } (77a) 
Z= om exp [- Both? 15] {ΤῸ 
nm Son[ ABBE er 


It only remains to evaluate a typical sum such as Z,. This is easily 
done if we note that, for any container where L, is of macroscopic size, 
the coefficient of n,? in (77a) is very small unless β is extremely large 
(ie., unless T is extremely small), Since successive terms in the sum 
differ thus only very slightly in magnitude, it is an excellent approxima- 
tion to replace this sum by an integral. By regarding a term in the 


+ To prevent the long expression in the exponent from overwhelming the poor letter ὁ, 
we here use the standard alternative notation exp u =e" to denote the exponential function, 


itd 


0128345678 91011121314 
ny 

Fig. 4.13 Schematic diagram illustrating the 
replacement of a sum over integral values of 
nz (the sum being equal to the area beneath 
the rectangles) by an integral over continuous , 
values of n; (the integral being equal to the 
area beneath the curve). 
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sum as a function of n, (considered as a continuous variable defined 
also for nonintegral values), the sum Z, thus becomes 


Ze = exp|- Bos Jan. 


1/2 2m L,? 
=(A)"(G)ioent-mim a 
svtigre «(2 } (ξ}» (79) 


12 ( L, 
way") 
The lower limit in the last integral of (78) has been put equal to zero 
without noticeable error since the coefficient of τις in (79) is extremely 


small. The last definite integral in (78) is merely equal to some con- 
stant so that (78) is of the form 


(80) 


where b is some constant involving the mass of the molecule.f The 
corresponding expressions for Z, and Ζ, are, of course, analogous 
to (80). Hence (76) yields the result 


2 = (opie )\og)(>pia) 


or Ζ-- 58 pr : where V=L,L,L: (81) 


is the volume of the box. Correspondingly, we obtain 


InZ=InV— #InB + 3Inb. (82) 


Our calculation is now essentially completed. Indeed, (74) yields 
for the mean energy € of a molecule the result 


c= BE -(-4)=3G) 


We have thus arrived at the important conclusion that 
μ᾿ 2 


for a monatomic molecule, 
ε = $kT. (83) 


} Although this will be of no particular importance to us, we note that the last integral 
in (78) has, by (M.21), the value /7/2; thus b= (m/2z)¥h-3, 
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* The mean kinetic energy of a molecule is thus independent of the size 
of the container and simply proportional to the absolute temperature T 
of the gas. 

If the molecules of the gas are not monatomic, the additive expres- 
sion (69) yields for the mean energy of a molecule the result 


€= & «Ὁ εἴ = BAT + HT) (84) 


since the mean kinetic energy of translation εἰ) of the center of mass 

~ is again given by (83). The mean intramolecular energy €” is, by (71), 

independent of the dimensions of the container and thus can only be a 
function of the absolute temperature T. 

Since the gas is ideal so that the interaction between the molecules 

is negligible, the total mean energy E of the gas is simply equal to the 

sum of the mean energies of the N individual molecules, Thus we have 


B=Né. (85) 


~ Even in the most general case, the mean energy of an ideal gas is inde-_ 
_pendent of the dimensions of the container and a function of the 
temperature only; i.e., 


for an ideal gas, 


E=£(r) (86) 


independent of the dimensions of the container. 


+ This result is physically plausible. The translational kinetic energy 
and intramolecular energy of a molecule do not depend on the separa- 
tions between the molecules, Hence a change of the container dimen- 
sions (at the fixed temperature T) leaves these energies unchanged so 

» that E remains unaffected. If the gas were not ideal, this conclusion 
would no longer be true. Indeed, if the gas is sufficiently dense, the 
mean separation between molecules is small enough so that their mean 
potential energy of mutual interaction is appreciable. A change of the 

** container dimensions (at the fixed temperature ΤΊ then results in a 
change in the mean separation of the molecules; correspondingly, it 
affects their mean intermolecular potential energy which is included 
in the total mean energy E of the gas. 


Said 


L, x 


Fig. 4.14 A box containing an ideal gas. A - 
molecule in a given state r exerts on the right 
wall of the box a force F, in the x direction. 
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4.8 Mean Pressure of an Ideal Gas 


It is experimentally quite easy to measure the mean pressure (i.e., the 
mean force per unit area) exerted by a gas on the walls of the container 
within which it is confined. Hence it is of particular interest to cal- 
culate the mean pressure exerted by an ideal gas. Let us denote by F 
the force in the x direction exerted by a single molecule on the right 
wall (i.e., the wall x = L,) of the box containing the gas. Let us 
denote by F, the value of this force when the molecule is in a particu- 
lar quantum state r where its energy is €,. The force F, can be readily 
related to the energy €,. Indeed, suppose that the right wall of the 
box were displaced very slowly to the right by an amount dL,. In this 
process the molecule would do on the wall an amount of work F, dL, 
which must be equal to the decrease —de, of the energy of the mole- 
cule. Hence 


F,dL, = —de, 
ee 
or, F,= aL.’ (87) 


Here we have written a partial derivative to indicate that the other 
dimensions L, and L, were supposed to remain constant in the argu- 
ment leading to (87). 

The mean force F exerted by a molecule on the wall is then given by 
averaging the force F, over all possible states r of the molecule. Thus 


as! RS Ser) 
de ( aL: 
Sieve 
τ 
where we have used the expression (67) for the probability Ρ, of finding 
the molecule in any state r. The relation (88) can be simplified since 


the sum in the numerator can again be expressed in terms of the sum 
in the denominator. Thus the numerator can be written 


Berd Bg) 


τ 


FaD2F= (88) 


“ΠΣ o-(Se *), 
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Using again the abbreviation Z of (73), the expression (88) becomes 


or F==— 


(89) 


δ This general relation can now be applied to the result (82) already 
obtained for In Z in the case of a monatomic molecule. Remembering 
that V = L,LyL,, the partial differentiation yields immediately 


poiamz_1 
Bethe: ως. Bly 


or (90) 
+ If the molecule is not monatomic, the expression (87) for the force 
F, becomes by (70) 
a Fe) 
--- [ = --ὄ. --. 
F, BL, [τ + εἰ] ὍΓς 


Here we have used the fact, noted in (71), that the intramolecular 
energy ¢ does not depend on the dimension L, of the box. Hence 
the force F is calculable from the center-of-mass translational energy 
jalone. The preceding computation, which was based on this transla- 
| tional energy, remains therefore equally valid for a polyatomic mole- 
cule. The expression (90) for F is accordingly a completely general 
result. Υ 
"Since the gas is ideal, the molecules move about without influencing 
+ each other appreciably. Hence the total mean normal force (i.e., force 
in the x direction) exerted on the right wall by all the molecules of the 
gas is simply obtained by multiplying {the mean force F exerted by one 
molecule} by {the total number N of molecules in the gas}. Dividing 
this result by the area L,L, of the wall gives then the mean pressure p 
exerted by the gas on this wall. The relation (90) thus leads to the 
result 
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Hence (91) 


or (92) 


where V = L,LyL- is the volume of the container and where n = N/V 
is the number of molecules per unit volume. Note that (92) makes no 
specific reference to the particular wall which was considered in the 
calculation. The calculation would thus yield properly the same result 
for the mean pressure ἢ exerted on any wall.t ἢ 


Discussion 

The important relations (91) and (92) can be expressed in another 
equivalent form. Thus the total number N of molecules is ordinarily 
deduced from macroscopic measurements of the number » of moles 
of gas present in the container. Since the number of molecules per 
mole is, by definition, equal to Avogadro’s number Νὰ, it follows that 
N=vNq. Hence (91) can also be written in the form 


pV = RT (93) 


where we have introduced a new constant R, called the gas constant, 
defined by the relation 


R=Nok. (94) 


The equation relating the pressure, the volume, and the absolute 
temperature of a substance in equilibrium is called the equation of 
state of that substance. Thus the equations (91) through (93) are dif- 


# Indeed, an elementary (and completely macroscopic) analysis of forces in a fluid in 
mechanical equilibrium shows that the pressure on any element of area must be the same 
anywhere in the fluid (if gravity is neglected) and must be independent of the spatial ori- 
entation of this element of area. 

{ Note concerning Secs. 4.7 and 4.8: 

Although our calculations of the mean energy and pressure have been carried through 
for a gas confined within a container in the simple shape of a rectangular parallelepiped, 
all our results are actually quite general and independent of the shape of the container. 
The physical reason is the following. Ab any ordinary temperature the momentum of a 


molecule is so large that#its de Brogife wavelength is negligibly small compared to the 


dimensions of any macroscopic container. Practically every region within the container 

is thus many wavelengths away from the container walls. Accordingly, the nature of the 

possible wave functions which can exist in such a region is utterly insensitive to the detailed 

boundary conditions imposed at the walls or to details involving the precise shape of these 
ae προ 
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ferent forms of the equation of state for an ideal gas. This equation 
of state, which we have derived on the basis of our theory, makes 
several important predictions: 


(i) If a given amount of gas, sufficiently dilute to be ideal, is main- 
tained at a constant temperature, it follows by (91) that 


pV = constant, 


i.e., that its pressure should be inversely proportional to its volume. 
This result was discovered experimentally by Boyle in 1662 (long be- 
fore the advent of the atomic theory of matter) and is accordingly 
called Boyle’s law. 


(ii) If a given amount of gas, sufficiently dilute to be ideal, is main- 
tained at constant volume, its mean pressure should be proportional 


(gto its absolute temperature. This result can be conveniently exploited, 


as will be shown in the next chapter, as a method for measuring the 
absolute temperature. 


(iii) The equation of state (91) depends only on the number of mole- 
cules, but makes no reference to the nature of these molecules. Hence 
the equation of state should be the same for any gas (whether it be 
He, He, Ne, Oz, CHy, ete.), as long as it is made sufficiently dilute to 
be ideal. This prediction can be checked experimentally and is well 
verified. 


x 


0 


Fig. 4.15 Dependence of the mean pressure 
P of an ideal gas on its volume Vat the absolute 
temperatures Τὶ 27, and 37, 


gee 
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Summary of Definitions 
absolute temperature The absolute temperature T of a macroscopic system [or its re- 
lated parameter β = (KT)] is defined by 


ΕΞ 
ἀπ 


ang 
TOE 


where 9(E) is the number of states accessible to the system in a small energy interval 
between E and E + 8E, and where k is a conventionally chosen constant called Boltz- 
mann’s constant. 

entropy The entropy 8 of a system is defined in terms of the number @ of states acces- 
sible to it by the relation 


S=kinQ. 


The entropy thus provides a logarithmic measure of the degree of randomness of the 
system. 

thermometer A relatively small macroscopic system arranged so that only one of its 
macroscopic parameters changes when the system gains or loses energy as a result of 
thermal interaction. 

thermometric parameter The variable macroscopic parameter of a thermometer. 

temperature of a system with respect to a given thermometer The value of the par- 
ticular thermometric parameter of this thermometer when the latter is in thermal equi- 
librium with the system. 

heat reservoir A macroscopic system which is sufficiently large with respect to a set of 
other systems under consideration so that its temperature remains essentially unchanged 
in any thermal interaction with those systems. 

Boltzmann factor The factor e~#® where is related to the absolute temperature T by 
B= (KT)~1 and where E denotes an energy. 

canonical distribution ‘The probability distribution according to which the probability 
P, of finding a system in a state r of energy E, is given by 


Ῥ, ας en Abr 


where β = (kT)-! is the absolute temperature parameter of the heat reservoir with 
which the system is in equilibrium. 

ideal gas A gas in which the energy of interaction between the molecules is almost 
negligible compared to their kinetic energy. 

nondegenerate gas A gas sufficiently dilute so that the mean separation between mole- 
cules is large compared to the mean de Broglie wavelength of a molecule. 

equation of state The relation connecting the volume, the mean pressure, and the abso- 
lute temperature of a given macroscopic system. 
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Important Relations 
Definition of absolute temperature: 


@) 
Definition of entropy: 
S=king. (i) 


Entropy increase of a system at absolute temperature T when it absorbs a small 
amount of heat dQ: 


as = 40, (ἢ 


Canonical distribution for a system in thermal equilibrium with a heat reservoir 
at absolute temperature T: 


P, x @7bEr, (iv) 
Equation of state of an ideal nondegenerate gas: 
p = nkT. (v) 


Suggestions for Supplementary Reading 

The following books provide alternative derivations of some of the results of this 

chapter: 

C. W. Sherwin, Basic Concepts of Physics, secs. 7.3-7.5 (Holt, Rinehart and Winston, Inc., 
New York, 1961). 

Ὁ. 5. Rushbrooke, Introduction to Statistical Mechanics, chaps. 2 and 3 (Oxford University 
Press, Oxford, 1949). 


ἘΞ C. Andrews, Equilibrium Statistical Mechanics, secs. 6-8 (John Wiley & Sons, Inc., 
New York, 1963). 


Historical and biographical accounts: 

H. Thirring, “Ludwig Boltzmann,” J. of Chemical Education, p. 298 (June 1952). 

E. Broda, Ludwig Boltzmann; Mensch, Physiker, Philosoph (Franz Deuticke, Vienna, 
1955). In German. 

L. Boltzmann, Lectures on Gas Theory, translated by 5, G, Brush (University of California 
Press, Berkeley, 1964). The introduction by Brush describes briefly the historical devel- 
opment of the description of matter in terms of atomic concepts. 

B. A. Leerburger, Josiah Willard Gibbs, American Theoretical Physicist (Franklin Watts, 
Inc., New York, 1963). 

L. P. Wheeler, Josiah Willard Gibbs, the History of a Great Mind (Yale University Press, 
New Haven, 1951; paperback ed., 1962). 

Μ. Rukeyser, Willard Gibbs (Doubleday & Company, Inc., Garden City, N-Y., 1942). 
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Problems 
4.1 Example of a peculiar thermometer 

The density of alcohol, like that of most substances, is found to decrease 
with increasing absolute temperature. The properties of water are, however, 
somewhat unusual. As the absolute temperature is increased above the tem- 
perature at which water melts (i.e., turns from ice into liquid water), its density 
at first increases and then, after going through a maximum, decreases. 

Suppose that an ordinary thermometer, of the type consisting of a liquid 
column within a glass tube, is filled with colored water instead of the colored 
alcohol which is commonly used. The temperature @ indicated by this type of 
thermometer is, as usual, taken to be the length of the liquid column. Imagine 
that this thermometer, when placed in contact with two systems A and B, indi- 
cates that they have respective temperatures 6, and θμ. 

(a) Suppose that the temperature 64 of system A is larger (higher) than the 
temperature 8g of system B. Can one necessarily conclude that heat will flow 
from system A to system B when these two systems are placed in thermal con- 
tact with each other? 

(b) Suppose that the temperatures 0, and 65 of the two systems are found to 
be equal. Can one necessarily conclude that no heat will flow from system A 
to system B when these two systems are placed in thermal contact with each 
other? 


4.2 Value of kT at room temperature 

One mole of any gas at room temperature and atmospheric pressure 
(106 dynes/cm?) is found experimentally to occupy a volume of approximately 
24 liters (1.6., 24 X 103 cm3). Use this result to estimate the value of kT at room 
temperature. Express your answer in ergs and also in electron volts. (1 electron 
volt = 1.60 x 10-1? erg.) 


4.3. Actual variation of the number of states with energy 
Consider any macroscopic system at room temperature. 

(a) Use the definition of absolute temperature to find the percentage increase 
in the number of states accessible to such a system when its energy is increased 
by 10-3 electron volt. 

(b) Suppose that such a system absorbs a single photon of visible light (having 
a wavelength of 5 x 10-5 cm). By what factor does the number of states acces- 
sible to the system increase as a result? 


4.4. Achievement of atomic spin polarization 

Consider a subst&nce which contains magnetic atoms having a spin 4 
and a magnetic moment jig. Since this moment is due to an unpaired electron, 
it is of the order of a Bohr magneton, ie., μο ~ 10.320 erg/gauss. In order to 
do scattering experiments from atoms whose spins are preferentially polarized 
ina given direction, one can apply a large magnetic field B and cool the substance 
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to a sufficiently low absolute temperature to achieve appreciable polarization. 

The largest magnetic field which can conveniently be produced in the labora- 
tory is about 50,000 gauss. Find the absolute temperature T which must be 
attained so that the number of atomic moments pointing parallel to the field 
is at least 3 times as large as the number of moments pointing in the opposite 
direction. Express your answer in terms of the ratio ΤΊ ΤᾺ where ΤῊ is room 
temperature. 


4.5 Proposed method for producing polarized proton targets 

In research in nuclear physics and elementary particles, it is of great 
interest to do scattering experiments on targets consisting of protons whose spins 
are preferentially polarized in a given direction. Each proton has a spin αὶ and 
a magnetic moment po = 1.4 x 10-3 erg/gauss. Suppose that one tries to 
apply the method of the preceding problem by taking a sample of paraffin (which 
contains many protons), applying a magnetic field of 50,000 gauss, and cooling 
the sample to some very low absolute temperature T. How low would this tem- 
perature have to be so that, after equilibrium has been reached, the number of 
proton moments pointing parallel to the field is at least 3 times as large as the 
number of proton moments pointing in the opposite direction? Express your 
answer again in terms of the ratio ΤΊ ΤᾺ where Tp is room temperature. 


4.6 Nuclear magnetic resonance absorption 

A sample of water is placed in an external magnetic field B. Each proton 
of the HzO molecule has a nuclear spin } and a small magnetic moment μο. Since 
each proton can point either “up” or “down,” it can be in one of two possible 
states of respective energies =oB. Suppose that one applies a radio-frequency 
magnetic field of frequency v which is such that it satisfies the resonance condi- 
tion hy = 2yoB, where 2B is the energy difference between these two proton 
states and h is Planck's constant. Then the radiation field produces transitions 
between these two states, causing the proton to go from the “up” state to the 
“down” state, or vice versa, with equal probability. The net power absorbed by 
the protons from the radiation field is then proportional to the difference between 
the numbers of protons in the two states. 

Assume that the protons always remain very close to equilibrium at the abso- 
lute temperature T of the water. How does the absorbed power depend on the 
temperature T? Use the excellent approximation based on the fact that μὴ is 
so small that oB< KT. 


4.7 Relative numbers of atoms in an atomic beam experiment 

Precision measurements of the magnetic moment of the electron were 
instrumental in leading to our modern understanding of the quantum theory of 
the electromagnetic field. The first such precision experiment, performed by 
Kusch and Foley [Physical Review 74, 250 (1948)], was based on a comparison 
of the measured total magnetic moments of gallium (Ga) atoms in two different 
sets of states labeled (in standard spectroscopic notation) as the 2P3/2 and 2P3/2 


“Ὁ 


180 Thermal Interaction 


states, respectively. The 2P;/2 states are the states of lowest possible energy of 
the atom. (There are two such states with the same energy; they correspond 
merely to the two possible spatial orientations of the total angular momentum 
of the atom in this set of states.) The *P3/2 states have an energy higher than that 
of the 2P;/2 states by an amount accurately known from spectroscopic measure- 
ments and equal to 0.102 electron volt. (There are four such states having the 
same energy; they correspond again merely to the four possible spatial orienta- 
tions of the total angular momentum of the atom in this set of states.) 

To perform the desired comparison, the number of atoms in the *P3/2 states 
must be comparable to the number of atoms in the *P3/z states. The atoms can 
be produced by heating gallium in an oven to a high absolute temperature T. A 
small hole in the oven wall then permits a few of the atoms to escape into a sur- 
rounding vacuum where they form an atomic beam on which the actual meas- 
urements are made. 

(a) Suppose that the absolute temperature T of the oven is 37x, where Tp is 
room temperature. What is then the proportion of gallium atoms in the beam 
which are in the *P/2 and P3/9 states, respectively? 

(b) The highest temperature which can be produced conveniently in such an 
oven is about 67. What then is the proportion of gallium atoms in the *P3/ and 
2P5/2 states, respectively? Is this proportion adequate for a successful experi- 
ment? 


4.8 Mean energy of a system with two discrete energy levels 
A system consists of N weakly interacting particles, each of which can 

be in either one of two states with respective energies εἰ and €2, where €; < €2- 

(a) Without explicit calculation, make a qualitative plot of the mean energy 
E of the system as a function of its absolute temperature T. What is E in the 
limit of very low and very high temperatures? Roughly near what temperature 
does E change from its low to its high temperature limiting values? 

(b) Find an explicit expression for the mean energy of this system. Verify 
that this expression exhibits the temperature dependence established qualita- 
tively in (a). 


4.9 Elastic properties of rubber 

A strip of rubber maintained at an absolute temperature T is fastened 
at one end to a peg; from its other end hangs a weight w. Assume the following 
simple microscopic model for the rubber band: It consists of a linked polymer 
chain of N segments joined end to enti; each segment has length a and can be 
oriented either parallel or antiparallel to the vertical direction. Find an expres- 
sion for the resultant mean length L of the rubber band as a function of το. 
(Neglect the kinetic energies or weights of the segments themselves, or any in- 
teraction between the segments.) 
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4.10 Polarization due to impurity atoms in a solid 
The following represents a simple two-dimensional model of a situation 
of actual physical interest. A solid at absolute temperature T contains No nega- 
tively charged impurity ions per unit volume, these ions replacing some of the 
ordinary atoms of the solid. The solid as a whole is, of course, electrically neu- 
tral. This is so because each negative ion with charge —e has in its vicinity one 
positive ion with charge +e. The positive ion is small and thus free to move 
between lattice sites. In the absence of an external electric field it will, there- 
fore, be found with equal probability in any one of the four equidistant sites 
surrounding the stationary negative ion. (See Fig. 4.16; the lattice spacing is a.) 
Ifa small electrical field € is applied along the x direction, calculate the electric 
polarization, i.e., the mean electric dipole moment per unit volume along the 

x direction. ‘i == is 
Fig. 4.16 Impurity atoms in the crystal lattice 
of a solid. 


4.11 Minimum property of the “free energy” for a system 
in contact with a heat reservoir 
When two systems A and A’ are placed in thermal contact, their total 
entropy tends to increase in accordance with the relation (20), i.e., 


AS + AS’ > 0. ῶ 


‘The equilibrium situation ultimately attained after system A has absorbed some 
heat Ὁ = AE thus corresponds to that where the total entropy 8. + S’ of the com- 
bined isolated system is maximum, 
Suppose now that A is small compared to A’ so that A’ acts as a heat reservoir 
at some constant absolute temperature 7’. The entropy change AS’ of A’ can 
then be expressed very simply in terms of AE and Τ΄, Show that (i) implies in 
this case that the quantity F= E — Τ' 8 tends to decrease and becomes a mini- Ξ: 
mum in the equilibrium situation. (The function F is called the Helmholtz free a 
energy of the system A at the constant temperature T’.) 


412 Quasi-static compression of a gas 
Consider a thermally insulated ideal gas of particles confined within a 

container of volume V. The gas is initially at some absolute temperature T. 
Assume now that the volume of this container is very slowly reduced by moving 
@ piston to a new position. Give qualitative answers to the following questions: 

(a) What happens to the energy levels of each particle? 

(0) Does the mean energy of a particle increase or decrease? 

(c) Is the work done on the gas in reducing its volume positive or negative? 

(@) Does the mean energy of a particle, measured above its ground state 
energy, increase or decrease? 

(6) Does the absolute temperature of the gas increase or decrease? 
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(b) By simply averaging, derive an expression for the mean force F in terms 
of the mean energy € of a particle. Use the symmetry requirement that 
ΚΙ = K,? = K. when the gas is in equilibrium. 

(c) Hence show that the mean pressure p exerted by the gas is given by 


pata @) 
where ἃ is the mean energy per unit volume of the gas. 


4.17 Pressure and energy density of electromagnetic radiation 

Consider electromagnetic radiation (i.e., a gas of photons) enclosed in 
a box of edge lengths L,, L,, and L,. Since a photon moves with the speed c of 
light, it is a relativistic particle. Hence its energy ε is related to its momentum 
nK by 


€= AK = A(K2 + ΚΙ + Κρ @) 


where the possible values of K., Ky, and K, are again given by (3.13). 

(a) Use this expression to calculate the force F, exerted by a photon on the 
right wall of the container when the photon is in a given state r specified 
by nz, my, me 

(b) By simply averaging, derive an expression for the mean force F in terms 
of the mean energy @ of a photon. Use the symmetry argument that K,? = 
K,? = KZ when the radiation is in equilibrium with the container walls. 

(c) Hence show that the mean pressure p exerted on the walls by the radiation 
is given by 

p= 48 ῷ 
where ἃ is the mean electromagnetic energy per unit volume of the radiation. 

(d) Why is the constant of proportionality in (ii) equal to 4 instead of being 
equal to the value ἃ derived in the preceding problem for a nonrelativistic gas? 


4.18 Mean energy expressed in terms of partition function 

Consider any system, no matter how complicated, in thermal equilib- 
rium with a heat reservoir at the absolute temperature T = (kB)-1. The proba- 
bility that the system is in any one of its states r of energy E, is then given by the 
canonical distribution (49). Obtain an expression for the mean energy E of this 
system. In particular, show that the arguments used in Sec. 4.7 are generally 
applicable and derive thus the very general result 


p= —2uz [ 
Here ΖΞΕΣ εν (ii) 


represents a sum over all possible states of the system and is called the partition 
function of the system. 


Fea 
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4.19 Mean pressure expressed in terms of partition function 
Consider again the system described in Prob. 4.18. The system is in 

thermal equilibrium with a heat reservoir at the absolute temperature T, but may 
be arbitrarily complicated (e.g., it might be a gas, a liquid, or a solid). For the 
sake of simplicity, assume that the system is confined within a container in the 
shape of a rectangular parallelepiped with edge lengths L., Ly, and L;. Show 
that the arguments used in Sec. 4.8 are generally applicable and establish thus 
the following very general results: 

(a) Show that the mean force F exerted by the system on its right boundary 
wall can always be expressed in terms of the partition function Z of the system 
by the relation 


ΡΞ. Re. @ 


Here Z is defined by the relation (ii) of the preceding problem. 

(b) In the case of any isotropic system, the function Z does not depend on the 
individual dimensions L,, Ly, and L;, but is merely a function of the volume 
V = L,LyL, of the system. Show that (i) implies then that the mean pressure p 
exerted by the system can be expressed in the form 


—. (ii) 


4.20 Partition function of an entire gas 
Consider an ideal gas consisting of N monatomic molecules. 
(a) Write down the expression for the partition function Z of this entire gas. 
By exploiting the properties of the exponential function, show that Z can be 
written in the form 


Z= ZN @ 


where Zo is the partition function for a single molecule and was already calculated 
in Sec. 4.7. 

(b) Use (i) to calculate the mean energy E of the gas by means of the general 
relation derived in Prob. 4.18. Show that the functional form of (i) implies 
immediately that E must be simply N times as large as the mean energy per 
molecule. 

(c) Use (i) to calculate the mean pressure p of the gas by means of the general 
relation derived in Prob. 4.19. Show that the functional form of (i) implies again 
that p must be simply N times as large as the mean pressure exerted by a single 
molecule. ‘i ‘ 


4.21 Mean energy of a magnetic moment 

Consider a single spin } in contact with a heat reservoir at the absolute 
temperature T. The spin has a magnetic moment μο and is located in an external 
magnetic field B. 
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(a) Calculate the partition function Z of this spin. 

(b) Using your result for Z, apply the general relation (i) of Prob. 4.18 to obtain 
the mean energy E of this spin as a function of T and B. 

(c) Verify that your expression for E satisfies the expression E = —jiB where ji 
is the value of the mean component of magnetic moment previously derived in 
Eq. (59). 


4.32 Mean energy of a harmonic oscillator 

A harmonic oscillator has a mass and spring constant which are such 
that its classical angular frequency of oscillation is equal to w. In a quantum- 
mechanical description, such an oscillator is characterized by a set of discrete 
states having energies E, given by 


E, = (n+ Pho. Θ 


The quantum number n which labels these states can here assume all the integral 
values 


n=0,1,2,3,.... (ii) 


A particular instance of a harmonic oscillator might, for example, be an atom 
vibrating about its equilibrium position in a solid. 

Suppose that such a harmonic oscillator is in thermal equilibrium with some 
heat reservoir at the absolute temperature T. To find the mean energy E of this 
oscillator, proceed as follows: 

(a) First calculate the partition function Z for this oscillator, using the defini- 
tion (ii) of Prob. 4.18. (To evaluate the sum, note that it is merely a geometric 
series.) 

(b) Apply the general relation (i) of Prob. 4.18 to calculate the mean energy 
of the oscillator. 

(c) Make a qualitative sketch showing how the mean energy E depends on 
the absolute temperature T. 

(d) Suppose that the temperature T is very small in the sense that kT <<“ hw. 
Without any calculation whatever, using only the energy levels of (i), what can 
you say about the value of E in this case? Does the result you obtained in 
(b) properly approach this limiting case? 

(e) Suppose that the temperature T is very high so that ΚΤ Ὁ» fw. What then 
is the limiting value of the mean energy E obtained in (Ὁ) How does it depend 
on T? How does it depend on w? 


"4.28. Mean rotational energy of a diatomic molecule 


The kinetic energy of a diatomic molecule, rotating about an axis 
Perpendicular to the line joining the two atoms, is classically given by 


where J is the angular momentum and A is the moment of inertia of the mole- 


i 


(b) 


Fig. 4.17 In (a) all the atoms of the solid (in- 
dicated by black circles) are shown in their 
normal positions while the possible interstitial 
positions (indicated by white dots) are unoccu- 
pied. At higher temperatures some of the in- 
terstitial positions are also occupied, as shown 
in (b). 
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cule. In a quantum-mechanical description, this energy can assume the discrete 
values 
ε,-- 514: ἢ @ 
2A 
where the quantum number j, which determines the magnitude of the angular 
momentum J, can assume the possible values 


j=0,1,2,3,.... (ii) 


For each value of j, there are (2j + 1) distinct possible quantum states which 
correspond to the discrete possible spatial orientations of the angular momentum 
vector J. 

Suppose that the diatomic molecule is in a gas in equilibrium at the absolute 
temperature T. To calculate the mean energy of rotation of this diatomic mole- 
cule, proceed as follows: 

(a) First calculate the partition function Z, using the definition (ii) of Prob. 4.18. 
(Be careful to remember that this is a sum containing a term for each individual 
state.) Assume that T is sufficiently large so that kT >> h?/2A, a condition satis- 
fied for most diatomic molecules at room temperature. Show that the sum Z 
can then be approximated by an integral, using u = j(j + 1) as a continuous 
variable. 

(b) Now apply the general relation (i) of Prob. 4.18 to calculate the mean rota- 
tional energy E of the diatomic molecule in this temperature range. 


4.24 Number of interstitial atoms in a solid (approximate analysis) 

Consider a monatomic crystalline solid consisting of N atoms and main- 
tained at an absolute temperature T, The atoms are ordinarily located at the 
normal lattice position indicated by the black circles in Fig. 4.174. An atom 
may, however, also be located at one of the interstitial positions indicated by the 
white dots in this figure. If an atom is in such an interstitial position, its energy 
is larger by an amount € than when it is in a normal position. When the absolute 
temperature is very low, all atoms will, therefore, be in normal positions. When 
the absolute temperature T is appreciable, however, this is no longer the case. 
Suppose that there are N atoms which can be located in N possible normal and 
N possible interstitial positions. The question of interest is then the following: 
At any absolute temperature Τὶ what is the mean number fi of atoms located in 
interstitial positions? An approximate way of solving the problem is the 
following: 

(a) Focus attention first on an individual atom and consider that it can be in 
either one of only two particular positions—one normal, the other interstitial. 
This system can then be in only two possible configurations, call them A and B: 

(A): atom in normal position, 
no atom in interstitial position. 
(B): no atom in normal position, 
atom in interstitial position. 
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What is the ratio Ps/P, of the probabilities Ps and P, of encountering these two 
configurations? 

(b) Focusing attention now on the whole solid, suppose that there are n atoms 
in interstitial positions. Then there must also be 7 atoms absent from normal 
positions. Since any one of the ἢ empty normal positions can be combined with 
any one of the i occupied interstitial positions, a B configuration can arise in any 
of n2 possible ways. According to this argument, the probability Ps of encoun- 
tering a single atom in the solid in a B configuration should be simply proportional 
to #2 if the empty normal and occupied interstitial positions are assumed to be 
distributed at random. Thus Ps ας #2. Show by a similar argument that 
Ῥ, α (N — ἢ). 

(c) Combining the results of parts (a) and (b) and assuming the usual situation 
where i < N, show that 

£ = εταγϑμε͵ (i) 
4.25 Number of interstitial atoms in a solid (exact analysis) 

Consider the physical situation described in Prob. 4.24. To do the 
problem exactly, try to find the probability P(n) that precisely n interstitial posi- 
tions are occupied by atoms. There are then, of course, also n normal positions 
unoccupied by atoms. 

(a) What is the probability of encountering a given situation where n inter- 
stitial atoms are distributed in one particular way and the n empty normal posi- 
tions are also distributed in one particular way? 

(b) In how many ways is it possible to distribute the n atoms over the N pos- 
sible interstitial positions? In how many ways is it possible to distribute the 
n missing atoms over the N normal positions? 

(c) Combining the results of parts (a) and (b), show that 


Fah faa 7 Jeo @ 


(d) The probability P(n) has a sharp maximum for some valuen = fi. To find 
this value ἢ, consider In P(n) and satisfy the condition (0 In P/én) = 0. Since 
all factorials are large, Stirling’s approximation (M.10) is applicable. Show thus 
that, for ἢ <N, one obtains the result 

£ = e-A2p, (ii) 
*4.26 Thermal dissociation of an atom 
‘An ideal gas of atoms is confined within a box with edge lengths Ls, Ly, 
and L,. The whole system is in equilibrium at some absolute temperature T. 
The mass of an atom is M. An atom A can dissociate into a positive ion A* and 
an electron e~ according to the scheme 


AzAt+e 
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An ionization energy u is required to overcome the electron binding and achieve 
ionization. 
Focusing attention on an individual atom, it can thus be in two possible con- 
figurations, call them U and D, 
(U): The atom is undissociated. Its energy E is given by 


E=« 


where ε is the kinetic energy of its center of mass. The 
translational state of motion of the atom can, as usual, be 
specified by the set of quantum numbers {n,,n,,.nz}- 
(D): The atom is dissociated into an electron of mass m anda 
positive ion of mass nearly equal to M (since m <M). 
The interaction between the ion and electron after dis- 
sociation can be assumed to be negligible. The total 
energy of the dissociated system consisting of two sepa- 
rate particles is then equal to 
E=e¢+e+u @) 


where εὖ is the kinetic energy of the ion, «~ is the kinetic 
energy of the electron, and τι is the ionization energy. 
The translational state of this dissociated system can 
then be specified by six quantum numbers, the set of 
quantum numbers {n,*,n,*,n.*} of the ion and the set 
of quantum numbers {n,~,n,~nz~) of the electron. 

(a) Using the canonical distribution find, to within a constant of proportion- 
ality C, the probability Py that the atom is found among those of its states where 
it is in the undissociated configuration (U). 

(b) Using the canonical distribution find, to within the same constant of pro- 
portionality C, the probability Pp that the atom is found among those of its states 


where it isin the dissociated configuration (D). [Note that the required sum over 
all the relevant states is the one already performed in calculating Z in Eq. (81) of 
Sec. 4.7] 


(c) Find the ratio Pp/Py. How does it depend on the temperature T and the 
volume V? 

(d) Now consider the whole gas containing N atoms. Suppose that a mean 
number ἢ of these are dissociated. Then there exist in the box’ ions andi elec- 
trons, while there remain (Ν᾽ — fi) undissociated atoms. A dissociated configura- 
tion of an atom caf then be realized in fi x ἢ = fi? possible ways, an undisso- 
ciated one in (N — Π) possible ways. By approximate reasoning similar to that 
of Prob. 4.24, one can then write 


Pp __ i? πϑ 


Py Non N 


if i<N. Find thus an explicit expression for (fi/N) in terms of the absolute 
temperature T and the density (N/V) of the gas. 
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(6) Ordinarily kT <u, Under these conditions would you expect most atoms 
to be dissociated or not? 

(f) Suppose that KT-< u, but that the volume of the box is made arbitrarily 
large while keeping the temperature T constant. Would most atoms then be 
dissociated or not? Give a simple physical explanation to account for this result, 

(g) The inner part of the sun consists of very hot dense gases, while its outer 
corona is cooler and less dense. Studies of spectral lines of light from the sun 
indicate that an atom may be ionized in the corona, and yet be un-ionized in re- 
gions closer to the sun where the absolute temperature is much higher. How do 
you explain these observations? 


*4.27 Thermal generation of a plasma 

By heating a gas of atoms to sufficiently high temperature, one may 
generate a plasma consisting of an appreciable number of dissociated positive 
and negative charges. To study the practical possibility of this procedure, apply 
the results of Prob. 4.26 to cesium vapor. The cesium atom has a rather low 
ionization energy u = 3.89 electron volts and an atomic weight of 132.9. 

(a) Express the degree of dissociation i/N of Prob. 4.26 in terms of T and the 
mean pressure ἢ of the gas. 

(b) Suppose that one heats cesium vapor to an absolute temperature 4 times 
as large as room temperature and maintains it at a pressure of 103 dynes/cm? 
(i.e., 108 of atmospheric pressure). Calculate the percentage of vapor ionized 
under these conditions. 


4.28 Dependence of energy on temperature for an ideal gas 

The number of states Q(E) of an ideal gas of N monatomic atoms de- 
pends on the total energy E of the gas in a manner derived in Prob. 3.8. Use this 
result and the definition 8 = @ In 0/@E to derive a relation expressing the energy 
E as a function of the absolute temperature Τ' = (kB)-1. Compare your result 
with the expression for E(T) derived in Sec. 4.7. 


4.29 Dependence of energy on temperature for a spin system 

The number of states Q(E) of a system of N spins 4, each having a 
magnetic moment po and located in a magnetic field B, has been calculated in 
Prob. 3.9. 

(a) Use this result and the definition β = (ὃ In 2/@E) to derive a relation ex- 
Pressing the energy E of this system as a function of the absolute temperature 
T= (kp). 

(b) Since the total magnetic moment M of this system is simply related to its 
total energy E, use the answer to part (a) to find an expression for M as a function 
of T and B. Compare this expression with the result derived for Mo in (61) 
and (59). 


ir 
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*4.30 Negative absolute temperature and heat flow in a spin system 

A system consists of N spins 4, each having magnetic moment po and 
located in an external magnetic field B. The number of states 2(E) of this system 
has already been calculated as a function of its total energy E in Prob. 3.9. 

(a) Make an approximate sketch showing the behavior of In Q as a function 
of E. Note that the lowest energy of the system is Ep = —NyoB and its highest 
energy is + NuoB, and that the curve is symmetric about the value E = 0. 

(b) Use the curve of part (a) to make an approximate sketch showing β as a 
function of E. Note that β = 0 for E = 0. 

(c) Use the curve of part (b) to make an approximate sketch showing the 
absolute temperature T ἃ5 ἃ function of E. What happens to T near E = 0? 
What is the sign of T for Ε « 0 and for E> 0? 

(d) Since T suffers a discontinuity near E = 0, it is more convenient to work 
in terms of 8. Show that @8/0E is always negative. Hence prove that, when 
two systems are placed in thermal contact, heat is always absorbed by the system 
with the larger value of 8. Note that this last statement is generally valid for all 
systems, irrespective of whether their absolute temperatures are positive or 
negative. 
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Canonical Distribution in the Classical 
Approximation 


The canonical distribution (4.49) represents a simple result of funda- 
mental importance and exceedingly great practical utility. As we 
showed in Chap. 4, it can be applied directly to calculate the equilib- 
rium properties of the most diverse systems. Thus we illustrated 
specifically how it can be used to derive the magnetic properties of a 
system of spins or to calculate the pressure and specific heat of an ideal 
gas. We also examined several interesting applications in the problems 
at the end of Chap. 4. It would carry us too far afield to discuss the 
wide range of other important applications, a task that might easily fill 
several books. In the present chapter we do, however, want to show 
how some particularly simple and useful results follow immediately 
from the canonical distribution when the approximations of classical 
mechanics are applicable. 


6.1 The Classical Approximation 

We know that the quantum-mechanical| description| of a system of 
particles, under appropriate circumstances, can be approximated by 
a description in terms of classical mechanics. In this section we shall 
want to examine the following two questions: (i) Under what condi- 
tions can a statistical theory in terms of classical concepts be expected 
to be a valid approximation? (ii) If the approximation is permissible, 


[how]can the statistical theory be formulated in classical terms? 


Validity of the classical approximation 

The classical approximation can certainly not be valid if the absolute 
temperature is sufficiently low. Indeed, suppose that the typical ther- 
mal energy kT is less than (or comparable to) the average ing AE 
between the energy levels of the system. It is then very significant 
that the possible energies of the system are quantized so as to be 
separated by discrete amounts. For example, the canonical distribu- 
tion (4.49) implies that the probabilities of finding the system in a state 
of energy E or in a state of next higher possible energy E + AE are 
then very different. On the other hand, if kT> AE, the probabilities 
vary very little from state to state. The fact that the possible energies 


we 
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are discrete rather than continuous then becomes relatively unimpor- 
tant and a classical description might become possible. The definite 
conclusion emerging from these comments is that 


a classical description cannot be valid if 
ΚΤ < AE. [6}] 


The classical approximation should certainly be valid, however, if 
quantum-mechanical effects can be shown to be of negligible impor- 
tance. The fundamental quantum-mechanical limitation on the mean- 
ingful use of classical concepts is expressed by the Hei uncer- 
tainty principle. This asserts that a simultaneous determination of a 
position coordinate q and its corresponding momentum p cannot be 
accomplished to infinite precision, but that these quantities are subject 
to minimum uncertainties of respective magnitudes Aq and Ap such 
that 


Aq Ap >A 6) 


where ἢ =h/2z is Planck’s constant divided by 27. Let us then 

_examine the classical description of a system at a particular tempera- 
ture. To be meaningful, this classical description must be able to con- 
sider a particle of the system localized within some typical minimum 
distance which we shall denote by So. Furthermore, we shall denote 
by po the typical momentum of the particle. If so and po are large 
enough so that 


Sopo>h, 


the limitations imposed by the Heisenberg uncertainty principle should 
become of negligible importance and the classical description should, 
accordingly, be valid. Thus we are led to the conclusion that 


a classical description should be valid 


if Sopo > h, (3a) 
ie, if $0 >%o. (3b) 


Here we have introduced the typical length Xo defined by 


o=—_=>——. 

Po Qa Po 
This is just the de Broglie wavelength h/po divided by 27. The state- 
ment (3D) asserts thus merely that quantum effects ought to be negligi- 
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Fig. 6.1 Classical phase space for a single 


particle in one dimension. Z 
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Fig, 6.2. The two-dimensional phase space of 
the preceding figure is here shown subdivided 
into equal cells of “volume” δ 5p = ho. 
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ble if the minimum significant classical dimension so is large compared 
to the de Broglie wavelength of the particle. The wave properties of 
the particle then clearly become unimportant. προς 


The classical description 
Suppose that a classical discussion of a system of particles is justified. 


+ Then the basic questions that must be examined are precisely the same 


as those which formed the starting point of our quantum-theoretical 
considerations at the beginning of Chap. 3. In particular, the first 
question that arises is the following: How does one specify the micro- 
scopic state of a system described in terms of classical mechanics? 

Let us begin by considering the very simple case of a system con- 
sisting of a single particle moving in one dimension. The position of 


. this particle can be described by a single coordinate, call it g. The 


vo 


fd 


complete specification of the system in classical mechanics then re- 
quires a knowledge of the coordinate q and of its corresponding 
momentum p.f (A simultaneous knowledge of q and p at any one 
time is classically possible. It is also required for a complete descrip- 
tion so that values of q and p at any other time can be predicted 
uniquely in accordance with the laws of classical mechanics.) It is 
possible to represent the situation geometrically by drawing cartesian 
axes labeled by q and p as shown in Fig. 6.1. Specification of q and p 
is then equivalent to specifying a point in this two-dimensional space 
(commonly called phase space). =o 

In order to describe the situation involving the continuous variables 
q and p so that the possible states of the particle are countable, it is 
convenient to follow the procedure of Sec. 2.6 by subdividing the 
ranges of the variables q and p into arbitrarily small discrete intervals. 
For example, one can choose fixed small intervals of size δῷ for the 
subdivision of 4, and fixed small intervals of size 5p for the subdivision 
of p. Phase space is thus subdivided into small cells of equal size and 
of two-dimensional “volume” (i-e., area) ae 


δᾳ 6p = ho 


{If q denotes an ordinary cartesian coordinate and if no magnetic field is present, the 
momentum p is simply related to the velocity τ of the particle of mass m by the proportion- 
ality p = mv. The description in terms of the momentum p rather than the velocity Ὁ is, 
however, valid in more general cases and is thus the one commonly used. 
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interval between q and q + δῷ and that its momentum lies in some 
particular interval between p and p + 6p, i.e., by specifying that the 
pair of numbers {q,p} lies in some particular range. Geometrically 
this specifies that the point represented by {q,p} lies in a particular cell 


of phase space. 


Remark concerning the size of ho 


The specification of the state of the sys- 
tem is clearly more precise the smaller 
one chooses the size of the cells into 
which phase space has been divided, i.e., 
the smaller one chooses hg. This con- 
stant ho can be chosen arbitrarily small 
in a classical description. The correct 
quantum-mechanical description im- 
poses, however, a limitation on the maxi- 
mum accuracy that can be achieved in a 


simultaneous specification of a coordi- 


wy? VS HARA on! 5 vos) at 
nate q and its corresponding momen- 
tum p. Indeed, g and p can only be de- 
termined within uncertainties Ag and Ap 
whose order of magnitude satisfies 
the Heisenberg uncertainty principle 
Δ 4p >. A subdivision of phase 
space into cells of volume less than h is 
thus physically meaningless; i.e, a 
choice of ho « h would lead to a specifi- 
cation of the system more precise than 
is allowed by quantum theory. 


set of f coordinates qu, . . . 


The generalization of the previous discussion to an arbitrarily com- 
plex system is immediate. Such a system can be described by some 
» qeand f corresponding momenta py, . .. , py, 
i.e., by a total of 2f numbers. (As usual, the number f of independent 
coordinates needed for the description of the system is called the num- 
ber of degrees of freedom of the system.) In order to deal with these 
continuous variables in a manner where the possible states of the sys- 
tem are countable, it is again convenient to subdivide the possible 
values of the ith coordinate q; into fixed small intervals of magnitude 
δι, and the possible values of the ith momentum pi into fixed small 
intervals of magnitude 5p;. For each i, the size of the subdivision in- 
terval can be chosen so that the product 


δα; Spi = ho 6) 


where ho is some arbitrarily small constant of fixed magnitude inde- 
Pendent of i. The state of the system can then be specified by stating 
that its coordinates and momenta are such that the set of values 


{91.925 ++ 46PLP2--- P= (ALP) 


lies in a particular set of intervals. In a convenient geometrical inter- 
Pretation this set of values can again be regarded as a “point” in 
a phase space of 2f dimensions where each cartesian axis is labeled 
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by one of the coordinates or momenta.+ The subdivision into in- 
tervals thus divides this space into equal small cells of volume 
(δι δ: - - - Sqr δρι Sp2 - -- δρὴ = ho. The state of the system can 
then be described by specifying in which particular set of intervals 
(ie., in which cell in phase space) the coordinates 41, 42, - - -» 41 and 
momenta pi, p2, -.- prof the system actually lie. For simplicity each 
such set of intervals (or cell in phase space) can be labeled by some 
index r so that all these possible cells can be listed and enumerated in 
some convenient order r = 1,2,3,.... Our entire discussion can then 
be summarized by the observation that 


the state of a system in classical mechanics can be de- 

Qty scribed by specifying the particular cell rin phase space | (6) 
in which the coordinates and momenta of the system 
are found. 


The specification of the state of a system in classical mechanics is 
thus very similar to that in quantum mechanics, a cell in phase space 
in the classical description being analogous to a quantum state in the 
quantum-mechanical description. One distinction, however, is worth 
noting, In the classical case there exists an element of arbitrariness 
since the size of a cell in phase space (ie., the magnitude of the con- 
stant ho) can be chosen at will. On the other hand, in the quantum 
description a quantum state is an unambiguously defined entity (essen- 
tially because quantum theory involves Planck’s constant ἢ which has a 
unique value). 


Classical statistical mechanics 

The statistical description of a system in terms of classical mechanics 
now becomes completely analogous to that in quantum mechanics. 
The difference is one of interpretation: whereas the microstate of a 
system refers to a particular quantum state of a system in the quantum 
theory, it refers to a particular cell of phase space in the classical theory. 
‘When considering a statistical ensemble of systems, the basic postulates 
introduced in the classical theory are the same as the corresponding 


postulates (3.17) and (3.18) of quantum theory. In particular, the 


statement (3.19) of these postulates asserts in classical terms the fol- 


} Except for the fact that it is not so readily visualized by our three-dimensional minds, 
this phase space is completely analogous to the two-dimensional phase space of Fig. 62. 
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lowing: If an isolated system is in equilibrium, it is found with equal 


probability in each one of its accessible states, i.e., in each one of its 


accessible cells in phase space.t 


Example 


In order to illustrate the classical notions 
in a very simple case, let us consider a 
single particle moving in one dimension 
under the influence of no forces, but con- 
fined within a box of length L. If we de- 
note the position coordinate of this par- 
ticle by x, the possible positions of the 
particle are then restricted by a condi- 
tion of the form Ὁ <x<L. The energy 
E of the particle of mass m is merely its 
kinetic energy so that 


aint ie 
Ea gee 52m 


where Ὁ is the velocity and p = mvis the 
momentum of the particle. Suppose 


2 that the particle is isolated and is thus 


i) 


~ known to have 8 constant energy in some 
small range between E and E + dE.- 


Then its momentum must lie in some 
small range dp about the possible values 
p=2+,/2mE. The region of phase 
space accessible to this particle is then 
the one indicated by the dark areas 


shown in Fig. 6.3. If phase space has 
been subdivided into small cells of equal 
size 6x dp = ho, this region contains a 
large number of such cells. These repre- 


sent the accessible states in which the ἡ 


system can be found. 

Suppose that the particle is known to 
be in equilibrium. Then the statistical 
postulate asserts that the particle is 
equally likely to be found with its coordi- 
nate x and momentum p in any one of the 
equal-size cells contained within the dark 
areas. This implies that the particle is as 
likely to have a momentum in the range 
dp near + \/2mE as in the range dp near 
—/2mE. It also implies that the po- 
sition coordinate x of the particle is 
equally likely to lie anywhere within the 
length L of the box. For example, the 
Probability that the particle is located in 
the left third of the box is 4 since the 
number of accessible cells for which x 
lies in the range O < x < Wis one-third 
of the total number of accessible cells. 


The preceding comments make it apparent that any general argu- 
ment based upon the statistical postulates and the counting of states 
must remain equally valid in the classical description. In particular, 
it follows that the derivation of the canonical distribution in Sec. 4.5 
remains applicable. Tf a system A, described classically, is in thermal 
equilibrium with a heat reservoir at the absolute temperature 


+ The statistical postulates can, with certain assumptions, be derived from the laws of 
classical mechanics in the same way that (3.17) and (3.18) could be based upon the laws of 
quantum mechanics. Such a classical derivation shows, incidentally, that a description in 
terms of coordinates and momenta (rather than coordinates and velocities) is the appro- 
Priate one in the most general case. In all the simple cases discussed in this book the dis- 
tinction is trivial, however, since the momentum p and velocity v of a particle of mass m in 
these cases will always be related by the simple proportionality p = mv. 


"" 


"» 


Fig. 6.3 Classical phase 
particle free to move in one dimension and 
confined within a box of length L. The par- 
ticle, specified by a coordinate x and a momen- 
tum p, has an energy in the range between E 
and E + 6E. The states accessible to the par- 
ticle are indicated by the cells contained within 
the dark areas. 


Fig, 6.4 A two-dimensional example of phase 
space subdivided into small cells of equal “‘vol- 
ume” δῷ 6p = ho. The dark region indicates 
an element of volume having a size dq dp and 
containing many cells. 
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T = (kB)~1, the probability P, of finding this system in a particular 
state r of energy E, is thus given by (4.49) so that 


P, & e7PEr, (7) 


Here the state r refers to a particular cell of phase space where the 
coordinates and momenta of A have particular values {q1,..., 4}; 
pi, -++;py}- Correspondingly, the energy E, of A denotes the energy 
E of this system when its coordinates and momenta have these partic- 
ular values, ie., 


E, = E(qi,---+ 96 Ὀν»--»»Ῥὴ (8) 


since the energy of A is a function of its coordinates and momenta. 

It is convenient to express the canonical distribution (7) in terms of 
a probability density by proceeding in the general manner familiar 
from Sec. 2.6. Let us thus try to find the following probability 


P(qis «++ 4; Pir - ++» Pi) Aga + +> ἀφ, ἄραι - ++ apy 

= the probability that the system A in 
contact with the heat reservoir is 
found to have its first coordinate in 
the range between q; and dqi,..., 
its fth coordinate in the range be- (9) 
tween 4; and q; + ἀφ, its first mo- 
mentum in the range between p; and 
Pi + dp;,..., and its fth momentum 
in the range between p;and p; + dp;. 


Here the ranges dq; and dp; are supposed to be small in the sense that 
the energy E of A does not vary appreciably when q; changes by an 
amount dq; or p; changes by an amount dp;. They are, however, sup- 


- posed to be large compared to the intervals used in the subdivision of 


phase space, i.e., dq; > ὅφι and dp; > 6p;. The element of volume 
(dq; - -- dq; dp; --- dp;) of phase space thus contains many cells, each 
of volume (δι - - - δ; δρι --- 5p;) = hol. (See Fig. 6.4.) In each of 
these cells the energy of the system A, and hence also its probability (7), 
is then nearly the same. .Hence the desired probability (9) is found 
simply by multiplying the probability (7) of finding A in a given cell of 
phase space by the total number (dq; - -- dp;)/ho! of such cells, ice., 


Ῥίᾳι, ..-. pi) dan --- ἄρ! & ere, Ma abr 
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or 


Ῥίᾳι, --- «Ῥὴ dqu --- dpy = Ce-PBax--9) dgy -- - dp, (10) 


where C is merely some constant of proportionality (which includes 
the constant ho’). The value of this constant is, of course, determined 
by the normalization requirement that the sum of the probability (10) 
over all accessible coordinates and momenta of the system A be unity, 
ie., that 


{ρίᾳι --- pi) dqa «++ ἄρ, τε ῦ 


where the integral extends over the entire region of phase space acces- 
sible to the system A. Thus it follows immediately that 


C= f CPE?) dgy +++ App. (11) 


These general considerations will be illustrated in the next section 
by applying them to a simple case of great importance, that of a single 
molecule moving in three dimensions. 


6.2 Maxwell Velocity Distribution 


Consider an ideal gas confined within a container of volume V and in 
equilibrium at the absolute temperature T. This gas may consist of 
several different types of molecules. We shall assume that conditions 
are such that a classical treatment of the molecules in the gas is permis- 
sible. At the end of our discussion we shall examine the range of con- 
ditions within which such classical considerations can be expected to 
be valid. Let us think thus in classical terms and focus attention on 
any one of the molecules of the gas. This molecule then constitutes 
a distinct small system in thermal contact with a heat reservoir which 
consists of all the other molecules and has a temperature T. The 
canonical distribution is thus immediately applicable. Suppose, for 
the time being, that the molecule is monatomic. If one neglects any 
external force fields (such as gravity), the energy ε of this molecule 
then is simply its kinetic energy 


ee es 
ca Σπιν τ (12) 
where vy is the velocity and p = mv is the momentum of the molecule 
of mass m. Here we have assumed that the gas is sufficiently dilute 
to be ideal; hence any potential energy of interaction with other mole- 
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cules is supposed to be negligible. The energy of the molecule any- 
where within the container is thus independent of the position vector 
r of the molecule. 

The state of the molecule is described classically in terms of the 
three position coordinates x,y,z of the molecule and its corresponding 
three momentum components p;,py,p:. We can then ask for the prob- 
ability that the position of the molecule lies in the range between r and 
r + dr (i.e., that its x coordinate lies between x and x + dr, its y coordi- 
nate lies between y and y + dy, and its z coordinate lies between z and 
z + dz) and that simultaneously its momentum lies in the range be- 
tween p and p + dp (i.e., that its x component of momentum lies be- 
tween ᾿ς and p; + dpz, its y component of momentum lies between 
py and py + dp,, and its z component of momentum lies between pz 
and pz + dp.). This range of position and momentum variables corre- 
sponds to a “volume” of phase space of size (dx dy dz dp; dp, dp:) = 
dx d3p. Here we have introduced the convenient abbreviations 


dr = dx dy dz 


and dp = ἄρ, ἀρ, ἀρ, ναὸ 
for an element of volume of real space and an element of volume 
of momentum space, respectively. Applying the canonical distribu- 
tion (10), we then immediately obtain for the desired probability 
P(r,p) dr d3p that the molecule has a position between r and r + dr 
and a momentum between p and p + dp the result 


P(r,p) der d3p x e602 dir dp (14) 


where 8 = (kT)“1. Here we have used the expression (12) for the 
energy of the molecule and have written p* = p?. Equivalently we can 
express this result in terms of the velocity ν = p/m of the molecule to 
find the probability P’(r,v) dr dv that the molecule has a position be- 
tween r and r + dr and a velocity between v and v + ἐν. Thus 


[γὼ d3r d3y x e~(1/2)pme dy ov | (15) 


where d3v = dv, dv, dv, and where v? = v?. 

The probability (15) is a very general result which provides detailed 
information about the position and velocity of any molecule in the gas. 
It allows us to deduce readily a variety of more special results. For 


» 
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example, we may ask how many molecules have a velocity in a specified 
range. Or more generally, if the gas consists of a mixture of different 
kinds of molecules having different masses (e.g., helium and argon 
molecules), we may ask how many molecules of a given kind have a 
velocity in any specified range. Fi ‘ocusing attention on the molecules 
of a particular kind, we can thus try to find 


flv) dv = mean number of molecules (of the 
specified kind), per unit volume, which (16) 
have a velocity between ν and v + dv. 


Since the N molecules of the ideal gas move independently without 


-appreciable mutual interaction, the gas constitutes a statistical en- 


semble of molecules of which a fraction given by the probability (15) 
have a position between r and r + dr and a velocity between v and 
v + dy. The mean number f(v) dv of (16) is thus simply obtained by 
multiplying the probability (15) by N, the total number of molecules 
of this kind, and dividing by the volume element dr. Thus 


NP'(r,v) dr d3v 
fv) dv = NEw) ἐδ ὧν 


or flv) Bv = Ο e-G/2)pmv gay an 


where C is a constant of proportionality and 8 = (kT)-1. The result 
(17) is called the Maxwell velocity distribution since it was first derived 
by Maxwell in 1859 (by the use of less general arguments). 

Note that the probability P’ of (15) [or the mean number f of (17)] 
does not depend on the position r of the molecule. This result must, 
of course, be true by virtue of symmetry considerations since a mole- 
cule can have no preferred position in space in the absence of external 
force fields. Note also that P’ (or f) depends only on the magnitude 
of v and not on its direction; i.e., 


fv) = fe) (18) 


1 where » = |v|. Again this is obvious by symmetry, since there is no 


Preferred direction in a situation where the container (and thus also 
the center of mass of the whole gas) is considered to be at rest. 
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Determination of the constant C 


The constant C can be determined by the 
requirement that the sum of (17) over all 
possible velocities must yield the total 
mean number n of molecules (of the kind 
considered) per unit volume. Thus 


Cher 8/200 dy 


οΥ Οὔ ο αμθηρναινεβ εν, θεν») do, doy ἀν, = 


n 


(19) 


Thus Cf ffenA/2meste-A/2amryze~a/2mve? do, dey doz = τι, 


or C ic we TINO doe [Eo timst doy [oer miemeet do, =n. 


Each of these integrals has, by (M.23), 
the same value 


[Teomime dos = ea -(2 
C= "(65 ile 


Hence 


Validity of the results for polyatomic molecules 


Suppose that the gas under considera- 
tion contains molecules which are not 
monatomic. Under the conditions con- 
templated in the preceding paragraphs, 
the motion of the center of mass of such 
ἃ molecule can still be treated by the 
classical approximation, although the in- 
tramolecular motion of rotation and 
vibration about its center of mass must 
ordinarily be discussed in terms of quan- 
tum mechanics. The state of the mole- 
cule can then be described by the posi- 
tion r and momentum p of its center of 
mass, and by specifying the particular 
quantum state s describing its intramo- 
lecular motion. The energy of the mole- 
cule is then 


P 


ae ee 
τας + (22) 
where the first term on the right is the 
kinetic energy of its center-of-mass mo- 
tion and the second term is its intramo- 
lecular energy of rotation and vibration 
inthe states. The canonical distribution 
allows us to write down immediately 
an expression for the probability 
P.(r,p) dr dp that the molecule is found 


in the state where its center-of-mass 
position is between r and r + dr, its 
center-of-mass momentum is between p 
and p + dp, and its intramolecular mo- 
tion is specified by s. Thus 


P,(r,p) dx dp c em Ain* mea") Br dp 
cx e-po/2m dor dip e-M'", 
(23) 


To find the probability P(r,p) d°r d®p that 
the center of mass has a position be- 
tween rand r + dr and a momentum be- 
tween p and p + dp, irrespective of the + 
molecule’s state of intramolecular mo- 
tion, it is only necessary to sum (23) over 
all possible intramolecular states s. But 
since the expression (23) is simply a 
product of two factors, the sum over all 
possible values of the second factor 
merely yields some constant multiplying 
the first factor. The result (23) reduces 7 
thus to an expression of the form (14) ~ 
describing the center of mass of the 
molecule. Hence (15) and the Maxwell 3 
velocity distribution (17) are very general — 
results which remain valid also in de- 
scribing the center-of-mass motion of 8 
polyatomic molecule in 2 gas. 
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6.3 Discussion of the Maxwell Distribution 

The Maxwell velocity distribution ( 17) allows us to deduce immediately 
some related velocity distributions, in particular the distribution of 
molecular speeds in a gas. As we shall see later, some of these results 
can be tested very directly by experiment. Let us explore some of 
these consequences of the Maxwell distribution and then examine the 
conditions under which the Maxwell distribution may be expected to 
be valid. 


Distribution of a velocity component ΄ 
Suppose that we focus attention on the component of a molecule’s 
velocity along a particular direction, say the x direction. Then we are 
concerned with the following quantity describing a given kind of 
molecule, 


g(vz) dv, =the mean number of molecules, per unit volume, 
which have an x component of velocity in the range 
between Ὁ; and v; + dv, (irrespective of the values 
of the other velocity components). 


We obtain this number by simply adding all the molecules which have 
an x component of velocity in this range. Thus 


g(e-) doz =) 


(vy) J (ue) fv) ὧν 


where the sums (i.e., integrations) extend over all the possible y and z 
velocity components of the molecules. Hence (17) yields 


(Uz) doz = οἵ, “fe feet) dv; dv, doz 


co peo 
= Ce-t/mve? dee f e-A/2)0m(052+02) ἄρ, dvz, 
co) 00 


e-(1/2)ames? dy, (24) 


or Βίυ4) dvs 


since the integration over all values of v, and Ὁ» gives merely some con- 
stant which can be absorbed into C’, a new constant of proportion- 
ality.} The constant C’ can again be determined by the requirement 
that the total mean number of molecules per unit volume is properly 
equal to n, i.e., by the condition 


Ws (Oz) dog = C[™ etuaemecs dv, = n. 
laa = 


4 Note that (24) is ἃ simple Gaussian distribution of the kind discussed in Appendix A.1. 


Fig. 6.5 Maxwell distribution showing the 
mean number g(v;) de; of molecules, per unit 
volume, having an x component of velocity 
between ὃς and v, + dv;. 


" 


236 Canonical Distribution in the Classical Approximation 


This gives C= (82), (25) 


The result (24) shows that the velocity component v; is distributed 
symmetrically about the value Ὁς = 0. The mean value of any velocity 
component of a molecule must therefore always vanish, i.e., 


δ: -Ξ 0. (26) 
This is physically clear by symmetry since the x component of velocity 


of a molecule is as likely to be positive as negative. Mathematically 
this result follows from the definition of the averaget 


5.Ξ. [“ ato. 
=F fetes) vz doz. 


Here the integrand is an odd function of v, (ie., reverses its sign when 
Ὅς reverses sign) because g(vz) is an even function of vz (Le., remains 
unchanged under this operation since it depends only on v,). Thus 
contributions to the integrand from +v, and —vz cancel each other. 

Note that g(v.) has its maximum value when τς = 0 and decreases 
rapidly as |vz| increases. It becomes negligibly small when 
| Bmv,*| > 1; that is, 


if |vz| > (KT/m)?, g(vz) > 0. (27) 


The distribution g(v,) thus becomes increasingly sharply peaked near 
τς = 0 if the absolute temperature T is reduced. This merely reflects 
the fact that the mean kinetic energy of a molecule becomes increas- 
ingly small as T > 0. 

Needless to say, exactly similar results hold for the velocity compo- 
nents v, and v; since all velocity components are, by the symmetry of 
the situation, completely equivalent. 


Distribution of molecular speeds 
Considering a given kind of molecule, let us now investigate the 
quantity 
F(v) dv = the mean number of molecules, per unit volume, which 
have a speed v=|y| in the range between Ὁ and 
v+ dv. = 
We can obtain this number by adding all molecules having speeds in 
this range, irrespective of the directions of their velocity. Thus 
F(v) dv = f Το) By (28) 


} The average can here be written as an integral in the manner of Eq. (2.78). 
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where the prime on the integral indicates that the integration is over 
all velocities satisfying the condition that 


v< |v] “υ- ἀν, 


ie., over all velocity vectors which terminate in velocity space within 
a spherical shell of inner radius v and outer radius Ὁ + dv. Since dv is 
infinitesimal and f(v) depends only on the magnitude of ν, the function 
ΚΝ) has essentially the constant value f(v) over the entire domain of 
integration of (28) and can thus be taken outside the integral. The re- 
maining integral represents merely the volume in velocity space of a 
spherical shell of radius » and thickness dv, a volume equal to the 
area ποῦ of this shell multiplied by its thickness dv. Hence (28) 


becomes simply 
Flv) dv = ἀπ) v2 ἀ0. (29) 


Using (17) this becomes explicitly 


F(v) dv = 4πῸ e-A/2)2mv2p2 dp (30) 


where C is given by (20). The relation (30) is the Maxwell distribution 
of speeds, Note that it has a maximum for the same reason that is 
responsible for the maxima encountered in our general discussion of 
statistical mechanics. As v increases, the exponential factor decreases, 
but the volume of phase space available to the molecule is proportional 
to v? and increases; the net result is a gentle maximum. 


£F(o) ἢ 


04} 


Fig. 6.7 Maxwell distribution showing the 02 
mean number F(v) dv of molecules, per unit 
volume, having a speed between Ὁ and v + dv. 
The speed v is here expressed in terms of the 
most probable speed = (2kT/m)/2. Also 
shown are the mean speed @ and the root- 
mean-square speed Um: = (62)1/2. 


Fig. 6.6 Velocity space indicated in two di- 
mensions, the Ὁ, axis pointing out of the paper. 
The spherical shell contains all molecules hav- 
ing a velocity ν such that Ὁ < |v| « υ + dv. 
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Of course, if F(v) dv is summed over all possible speeds Ὁ = |v|, 
the result must again be the total mean number n of molecules per unit 
volume, i.e., 

Fle) do = n. (31) 
+ The lower limit of the integral reflects the fact that, by its definition, 
the speed v = |p| of a molecule cannot be negative. 

A plot of F(v) as a function of the speed v is shown in Fig. 6.7. The 
particular speed v = 6 where F(v) has its maximum value is called the 


Fig. 6.8 Behavior of the Maxwell distribution 3 : 
of molecular speeds as a function of tempera- most probable speed. It can be found by setting dF/dov = 0. By using 


ture. (30), this condition becomes 


(— Bmw e~/2)0mv2)p2 4 -A/20M*(20) = Ὁ 


so that b= ves = f=. (32) 


3 Let us, for example, consider nitrogen (Nz) gas at room temperature 
so that T~ 300°K. Since the molecular weight of No is 28 and 
Avogadro’s number is 6 X 1023 molecules/mole, the mass of a Nz 
molecule is m ~ 28/(6 χ 1023) ~ 4.6 x 10-23 gm. Hence (32) yields 
for the most probable speed of such a Ne molecule 


b= 42 Χ 105 cm/sec = 420 m/sec, (33) 


a number of the order of the speed of sound in the gas. 


.2- Validity of the classical discussion of a gas 
Now let us examine under what conditions our classical discussion of 
the ideal gas, and hence also the Maxwell velocity distribution, can be 
+ \ expected to be valid. Our criterion of validity is the condition (3) fol- 
lowing from the Heisenberg uncertainty principle. If the condition 
+ (3) is satisfied, the classical description ought to be adequate since it 
makes no statements that might violate limitations imposed by quan- 
tum ideas. 

2 Since we are merely interested in typical orders of magnitude, we 
can be content with approximate estimates of the pertinent quantities 
in (3). The typical magnitude po of the momentum of a molecule of 
mass m ina gas at temperature T can be found from the most probable 
speed ὃ of such a molecule. Thus, by (32), 


po = mb = \/2mkT. 


* The corresponding typical de Broglie wavelength Xo of the molecule 


ὄ 
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is then 5 at σαν, ΞΕΘΣ (34) 

The classical description considers the molecules as distinguishable 
particles traveling along well-defined trajectories. This point of view 
ought certainly to be valid if there exist no quantum mechanical limita- 
tions preventing a molecule from being localized within a distance no 
greater than the typical separation so between nearest molecules. This 
requires, in accordance with (3), that 


(35) 


(The quantum-mechanical discussion shows that quantum effects do 
indeed become significant when the condition (35) is violated, pre- 
cisely because the essential indistinguishability of the molecules is then 
of paramount importance). To estimate the typical separation so be- 
tween nearest molecules, imagine each molecule to be at the center 
of a little cube of side so, these cubes filling the volume V available to 
the gas consisting of N molecules. Then 


or 80 = (x) Ξε πη" (36) 


where n = N/Vis the number of molecules per unit volume. The con- 
dition (35) for the validity of the classical approximation thus becomes 
ΣΝ nis 
— zh <1. (37 
So ‘QmkT < (37) 
This shows that the classical approximation ought to be applicable if 
the gas is sufficiently dilute so that n is small, if the temperature T is 
sufficiently high, and if the mass m of a molecule is not too small. 


Numerical estimates 


To estimate the typical magnitudes nu- molecular mass m 
merically, consider helium (He) gas at 
room temperature and atmospheric pres- 
sure (760 mm Hg). The relevant param- 
eters are then: 


4 


=; = 6: 10-24 gm. 
poe ee 


The equation of state of an ideal gas gives 
mean pressure Ὁ 

= 760 mm Hg = 105 dynes/em?; 
temperature T 

= 300°K; hence kT = 4.1 x 10— erg; 


— tb = 25 X 1019 molecules/em’, 


Hence (34) and (36) yield the estimates 


pj FA. 3.5. 


Fig. 6.9 James Clerk Maxwell (1831-1879). 
Although best known for his fundamental work 
in electromagnetic theory, Maxwell also con- 
tributed significantly to macroscopic thermo- 
dynamics and to the atomic theory of gases. 
He derived the molecular velocity distribution 
in 1859. After spending the earlier part of his 
career at the University of Aberdeen in Scot- 
land, Maxwell became professor at Cambridge 
University in 1871. (From G. Holton and 
D. Roller, “Foundations of Modern Physical 
Science,” Addison-Wesley Publishing Co., Inc., 
Cambridge, Mass., 1958. By permission of the 
publishers.) 
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Xo =O14A 
80 = 33 ἃ 


where 1A = 10- cm. Here the condi- 
tion (35) is well satisfied and the classical 
approximation ought to be very good. 
Most gases have larger molecular weights 
and thus smaller de Broglie wavelengths; 
the condition (35) is then even better 
satisfied. 

On the other hand, consider the con- 
duction electrons in a typical metal such 
as copper. In first approximation, inter- 
actions between these electrons can be 
neglected so that they can be treated as 
an ideal gas. But the numerical values 
of the significant parameters are then 
quite different. The mass of an electron 
is very small, about 10-27 gm or 7300 
times less than that of a He atom. This 
makes the de Broglie wavelength of the 


and 


electron much longer, 
Xo = (0.14) x 7300 = 12 A. 


Furthermore, since there is about one 
conduction electron per atom in the 
metal and since a typical interatomic 
spacing is about 2 A, 


So = DA. 


The distance between particles is thus 
much smaller than in the case of He gas, 
i.e., the electrons in a metal form a very 
dense gas. These estimates show that 
the condition (35) is not satisfied by the 
electrons in a metal. Hence there exists 
no justification for discussing such elec- 
trons by classical statistical mechanics. 
In fact, it is then essential to give a com- 
pletely quantum mechanical treatment 
which takes into account the Pauli exclu- 
sion principle obeyed by the electrons. 


6.4 Effusion and Molecular Beams 


Consider a gas in equilibrium inside some container. Suppose that a 
small hole of diameter D (or a narrow slit of width D) is now made in 
one of the walls of this container. If the hole is sufficiently small, the 
equilibrium of the gas inside the container should be disturbed to a 
negligible extent. The few molecules escaping through the hole into 
a vacuum surrounding the container should then constitute a repre- 
sentative sample of the molecules of the gas in its equilibrium state. 
Indeed, the molecules that have thus escaped can be collimated by 
slits to form a well-defined beam and, since they are few in number, 
their mutual interaction in this beam is insignificant. The molecules 
in such a beam can then be studied very effectively with two possible 
aims: (i) It may be of interest to study the properties of the molecules 
in the gas in equilibrium inside the container. For example, one may 
want to check whether the molecules in the container have velocities 
distributed in accordance with the predictions of the Maxwell distri- 
bution. (ii) It may be of interest to study the properties of essentially 
isolated molecules or atoms in order to investigate fundamental atomic 
or nuclear properties. Several Nobel prizes attest to the fruitfulness 
of this technique. We need only mention the fundamental experi- 
ments of Stern and Gerlach which led to the discovery of the spin and 


10 
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associated magnetic moment of the electron, those of Rabi and co- 
workers which led to precision measurements of nuclear magnetic 
moments, and those of Kusch and Lamb which helped to lead to our 
modern understanding of the quantum theory of electromagnetic 
interactions. ἢ 

How small must be the size D of the hole so that the equilibrium of 
the gas inside the container is disturbed to a negligible extent? The 
hole must be sufficiently small so that the few molecules which are in 
the vicinity of the hole (and thus manage to escape through it) do not 
affect appreciably the large number of molecules in the remainder of 
the gas. This condition is satisfied if, during the time that a molecule 
spends in the vicinity of the hole, it suffers practically no collisions with 
other molecules. But if the mean speed of a molecule is δ, the time 
that a molecule spends near the hole is of the order of D/é. On the 
other hand, the time between successive collisions of a molecule with 
other molecules is of the order of 1/é if 1 denotes the mean free path 
of a molecule in the gas.{_ The preceding condition is thus equivalent 
to the statement that 


Fig. 6.10 Formation of a molecular beam by 
molecules escaping through a small hole in 
container. = 


D ul 
Ἐπ τ 


or D<l (38) 


If this condition is satisfied, the molecules inside the container remain 
essentially in equilibrium (although their total number decreases slowly) 
and the escape of the molecules through the hole is said to constitute 
effusion. 


Remark 


The situation is appreciably different if 
D> Iso that molecules suffer frequent 
collisions with each other near the hole. 
When some molecules emerge through 
the hole (as in Fig. 6.10), the molecules 
behind them are then appreciably af- 
fected. They no longer continue collid- 
ing with the molecules on the right which 
have just escaped through the hole, but 
they still suffer frequent collisions with 


the molecules on the left. These colli- 
sions cause the molecules near the hole 
to experience an unbalanced force to the 
right and thus to acquire a net velocity in 
the direction toward the hole. The re- 
sultant motion of all these molecules 
moving together as 8 group is then analo- 
gous to the fiow of water through the hole 
ofa tank. In this case one has not effu- 


sion, but hydrodynamic flow. 


1 A good and readable review of molecular beam experiments can be found in an article 
by O. R. Frisch in Sci, American, vol. 212, p. 58 (May 1965). 

1 As discussed in Sec. 1.6, the mean free path is defined as the mean distance traveled by 
# molecule in the gas before it collides with some other molecule. 


Topumps 


Fig. 6.11 A molecular-beam apparatus for 
studying the velocity distribution of silver (Ag) 
atoms. The Ag atoms stick to the drum sur- 
face upon impact. 


242 Canonical Distribution in the Classical Approximation 


Slow Ag atoms 


| | deposited here 
a aa 


eat EES Fast Ag atoms 
— | ail deposited here 


aa 2 
Collimating slits Rotating drum 


Topumps 

If the hole is small enough so that the condition (38) is satisfied, the 
equilibrium of the gas is not affected appreciably by the presence of 
the hole. Accordingly the mean number 7 of molecules escaping per 
unit time through the hole is the same as the mean total number of 
molecules that would, per unit time, strike the area occupied by the 
hole if this hole had never been made. Hence 70 is simply given by 
the approximate expression (1:18) obtained in Sec. 1.6, i.e. 


Fo = nb (39) 


where n is the mean number of molecules per unit volume and @ is their 
mean speed.{ If one wants to focus attention only on those molecules 
having a speed between v and v + dv, the mean number 7(v) do of 
such molecules escaping through the hole per unit time is, analogously 
to (39), approximately given by 


F(v) dv = ἐ[Ἐ{0) 40]0 (40) 


where F(v) dv denotes the mean number of molecules having a speed 
between v and v + dv. Using the Maxwell distribution of speeds (30), 
one then obtains the proportionality 


F(v) do x p3e-/2)Ame, (41) 


The last factor v in (40) expresses merely the fact that a fast molecule 
is more likely to escape through the hole than a slow one. 

By measuring the relative number of molecules having various 
speeds in the molecular beam emerging from the hole in the container, 


+ An exact calculation yields, instead of (39), the result 70 = 4nd, See Appendix A4 
where the exact calculation is discussed. 


Sec. 6.4 243 


_——T~_Detector 
Disks__--— ae 


/ 


Source! 


foe es wal 


r 
\ 
\ 


To pumps 


one can test the prediction (41) and thus the Maxwell distribution upon 
which it is based. One experimental arrangement for achieving this 
purpose is shown in Fig. 6.11. Here silver is heated in an oven to gen- 
erate a gas of silver (Ag) atoms, some of which emerge through a nar- 
row slit to form an atomic beam. A hollow cylindrical drum, which 
has a slit in it and which rotates rapidly about its axis, is located in front 
of the beam. When the Ag atoms enter the slit in the drum, they re- 
quire different times to reach the opposite side of the drum, a fast atom 
requiring less time than a slow one. Thus, since the drum is rotating, 
Ag atoms with different speeds strike the inside surface of the drum 
at different places and then stick to it. A subsequent measurement of 
the thickness of the deposited silver layer as a function of distance 
along the inside drum surface thus provides a measurement of the 
atomic velocity distribution. 

A more accurate method for determining the velocity distribution 
uses a device which selects molecules having a particular velocity, (See 
Fig. 6.12. The method is analogous to the toothed-wheel method used 
by Fizeau to measure the speed of light.) In this method the molecular 
beam emerges from a hole and is detected at the other end of the 
apparatus. The velocity selector, which is placed between the source 
and the detector, consists in the simplest case of a pair of disks mounted 
on a common axle that can be rotated with known angular velocity. 
Both disks are identical and each has a slot cut into its periphery. The 
rotating disks act thus as two shutters which are alternately opened 


Fig. 6.12 A molecular-beam apparatus for 
studying the molecular velocity distribution by 
a velocity selector, By the time the beam of 
molecules reaches the second disk, the slot in 
this disk has ordinarily moved to a position 
such that the beam cannot pass through this 
disk. An exception occurs only if this disk has 
rotated by one revolution (or an integral num- 
ber of revolutions) in the time required by the 
molecules to traverse the distance between the 
two disks. (A more effective velocity selector 
results if more than two similar disks are 
mounted on the same axle.) 
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Fig. 6.13 Photograph of a modern molecular- 
beam apparatus used to study the properties 
(Photo- 


Ramsey, He 
of hydrogen molecules and atoms. 
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graph by courtesy of Professor Norman F. 
Harvard University.) 
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Fig. 6.14 Schematic diagram showing the essential ingredients 
of the molecular-beam apparatus of the preceding photograph. 
The source of molecules is the container 8, while D denotes some 
kind of device detecting the molecules arriving at the other end 
of the apparatus, The inhomogeneous magnetic fields produced 


με | ' { | 


Main chamber 


To pump To pump To pump 
by the A and B magnets exert forces on the very small magnetic 
moments of the molecules and thus deflect them along the indi- 
cated paths. The experiments detect the effect of radio-fre- 
quency radiation acting on the molecules in the experimental 


region of the C magnet. 
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and closed. When the disks are properly aligned and not rotating, all 
molecules can reach the detector by passing through the slots in both 
disks, But when the disks are rotating, molecules passing through the 
slot in the first disk can only reach the detector if their velocity is such 
that the time of flight required for them to travel to the second disk is 
equal to the time required for one revolution of the disks (or an integral 
number of such revolutions).}_ Otherwise they strike the solid part of 
the second disk and are stopped. Hence different angular velocities 
of rotation of the disks allow molecules of different speeds to reach the 
detector. Measurements of the relative number of molecules arriving 
at the detector per second thus allow a direct check of the molecular 
velocity distribution. The validity of the Maxwellian distribution has 
been well confirmed by such experiments. 

The phenomenon of effusion also has several practical applications 
outside the realm of molecular beams. Returning to the relation (39), 
note that a knowledge of the absolute temperature T and mean pres- 
sure p of the gas allows us to calculate n and δ. Thus the equation of 
state of an ideal gas gives n = p/kT. Furthermore, the mean speed 5 
of a molecule is approximately equal to its most probable speed (32); 
thus 6 « (kT/m)/?, Hence (39) yields 


σέ Bor. (42) 
The rate of effusion 7o is seen to depend on the mass of the molecule 
because a lighter molecule has a higher mean speed than a heavier one 
and thus effuses more rapidly. This property can be exploited to 
achieve a practical method for the separation of isotopes. Suppose 
that a container is closed off by a membrane which has very many small 
holes through which molecules can effuse, If this container is sur- 
rounded by a vacuum on the outside and is filled with a gas mixture of 
two isotopes at some initial time, then the relative concentration of 
the isotope of larger molecular weight will increase in the container as 
time goes on. Similarly, the gas pumped off from the surrounding 
vacuum will be more concentrated in the lighter isotope. This method 
of isotope separation has been of practical importance in obtaining 
uranium rich in 2351), the isotope which undergoes ready nuclear fission 
and is thus of great importance for the operation of nuclear power 
reactors (or the manufacture of nuclear weapons). Ordinary uranium 
consists mostly of the isotope 238U, but by using the chemical com- 
pound uranium hexafluoride (UFs), which is a gas at room tempera- 


{ Experimentally one can readily distinguish between observations corresponding to 
such different integral numbers. 
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ture, one can separate by effusion the slightly lighter 235UFg molecules 
from the much more abundant and slightly heavier 25SUF 5 molecules. 
Since the difference in mass between these molecules is so small, the 
process of effusion must be repeated many times in succession to 
achieve an appreciably increased concentration of the 2351 isotope. 


6.5 The Equipartition Theorem 
The canonical distribution in its classical form (10) is a function of 
coordinates and momenta which are continuous variables. Any cal- 
culation of mean values can thus be reduced to a calculation of integrals 
rather than of discrete sums. Under certain conditions the mean 
energy of a system can then be computed in a particularly simple way. 
To be specific, consider any system described classically in terms of 
f coordinates qi, ... , q¢and f corresponding momenta pi, . - « , py- Its 
energy E is then a function of these variables, ie., E = E(qu, . . - ,p/)- 
It is frequently the case that this energy is of the form 


E = a(pi) + E(qa,--- Pi) (43) 


where εἰ is a function of the particular momentum p; only and where Ε΄ 
may depend on all coordinates and momenta except p;. (For example, 
the functional form (43) may arise because the kinetic energy of a 
particle depends only on its momentum components while its potential 
energy depends only on its position.) Suppose that the system under 
consideration is in thermal equilibrium with a heat reservoir at an 
absolute temperature T. What then is the mean value of the energy 
contribution « in (43)? 

The probability of finding the system with its coordinates and 
momenta in a range near {q1, . . . .qpP1,-- - pr} is given by the canon- 
ical distribution (10) with the constant C determined by (11). By its 
definition, the mean value of ¢; is found by performing the appropriate 
sum (or integral) over all possible states of the system, i.e., 

= fer AB) εἰ αι -+- apy 
σε = ἐπε, αρη ἴηι == dp; 
where the integrals extend over all possible values of all the coordinates 
qa, ---,qy and all the momenta pi,...,p/- By virtue of (43), the ex- 
pression (44) becomes Ε΄ 
= _ fen Mei t®) ε, day --- dpy 
σι τὸ Te PatB dq, --- dp; 
το Se~Poses dpi [6558 ἀφ --- dpy 
feb dp; [΄ 6 PF dq --- dp; 


(44) 
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where we have used the multiplicative property of the exponential 
function. The primes on the last integral signs indicate that these in- 
tegrals extend over all the coordinate ᾧ and momenta p except p;. But 
since the primed integral in the numerator is the same as that in the 
denominator, the resulting cancellation leads immediately to the simple 
result 


=, — Soha dpi (45) 


ἕξι τε Som dps” 


In short, since εἰ involves only the variable p;, all the other variables 
are irrelevant in calculating the mean value of εἰ. 

The expression (45) can be simplified further by relating the integral 
in the numerator to that in the denominator. Thus 


or ἔπ -- in (“5 ἀρι) (46) 


where the explicit limits in the integral reflect the fact that the momen- 
tum pj; can assume all possible values from —oo to +00. 

Consider now the specific case where εἰ is a quadratic function of pi, 
as it would be if it represents a kinetic energy. In short, suppose that 
εἰ is of the form 


«= bp? (47) 
where b is some constant. Then the integral in (46) becomes 
oo eo 00 
flee dp; = f a Pht dpi = pwns" «εν dy 
where we have introduced the variable y= B/2p;._ Hence 
in (“5 ἀρι) ΠΝ ὙΠῸ ([..»» dy). 


But the integral on the right does not involve ββ at all. The differentia- 
tion of (46) yields thus simply 


a= 


or (48) 


Note that, although the starting point (44) of our calculation involved 


ap 


te 
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a formidable array of integrals, we were able to derive the final result 
(48) without evaluating a single one. 

If the functional forms (43) and (47) had been the same except for 
involving a coordinate qj instead of a momentum pj, all our previous 
arguments would be identical and would thus lead again to (48). 
Hence we have established the following general statement, known as 
the equipartition theorem: 


If a system described by classical statistical mechanics is 
in equilibrium at the absolute temperature T, every in- (48a) 
dependent quadratic term in its energy has a mean value 
equal to $kT. τ 


6.6 Applications of the Equipartition Theorem 

Specific heat of a monatomic ideal gas 

The energy of a molecule in such a gas is simply its kinetic energy (12), 
ie., 


e= be + py? + pe?). (49) 


By virtue of the equipartition theorem, the mean value of each of the 
three terms in this expression is equal to ἀκτ. Hence it follows imme- 
diately that 


@ = kr. (50) 
Since one mole of gas consists of Avogadro’s number N, of molecules, 
the mean energy of the gas is thus, 
per mole, E = N,($kT) = RT (51) 


where R= Νὰ is the gas constant. By virtue of (5.23), the molar 
specific heat cy at constant volume is then equal to 


ἘΞ (4), =3R. (52) 


This result agrees with that previously obtained in (5.26) on the basis 
of quantum-mechanical reasoning applied to a gas sufficiently dilute 
to be ideal and nondegenerate.+ 


+ In accordance with (37), quantum effects should indeed be unimportant for a suffi- 
ciently dilute gas. The agreement between the classical and quantum results is thus to be 
expected. 
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Kinetic energy of a molecule in any gas 
Consider any gas, not necessarily ideal. The energy of any molecule 
of mass m can then be written in the form 


εΞ H+, where ¢*) = oe + py? + pz). 


The first term is the kinetic energy of the molecule and depends on 
the momentum components pz, py, pz, of its center of mass. The term 
ε΄ may involve the position of the center of mass of the molecule (if the 
molecule is located in an external force field or if it interacts appre- 


- ciably with other molecules); it may involve coordinates and momenta 


describing the rotation or vibration of the atoms of the molecule with 
respect to its center of mass (if the molecule is not monatomic); but it 
does not involve the center-of-mass momentum p. Hence the equi- 
partition theorem allows us again to conclude immediately that 


saps = dmv = Sir (53) 
or of = {KE (54) 


m 


Since ὃς = 0 by symmetry, as already mentioned in (26), the result (54) 
represents also the dispersion (Av,)? of the velocity component v,. 
The three quadratic terms of the kinetic energy e then yield for its 
mean value, as in (50), the result 


Mann (65) 


Brownian motion ἢ 
Consider a macroscopic particle of mass πὶ (about a micron in size) 
suspended in a fluid at an absolute temperature T. The energy of this 
particle can again be written in the form 


e= 


Sob? + py? + 2) Ὁ €. 

Here the first term is the kinetic energy involving the velocity v or 
momentum p = mv of the center-of-mass motion of the particle, while 
τ΄ is the energy associated with the motion of all the atoms of the par- 
ticle with respect to its center of mass, The equipartition theorem 
leads then again to the results (53) and (54) so that 


of =A, (56) 


+ 
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Since the mean value 6; = Ὁ vanishes by symmetry, (56) gives directly 
the dispersion of the velocity component Ὁ,. The result (56) thus 
shows immediately that the particle does not simply remain at rest, but 
must always exhibit a fluctuating velocity, The existence of the 
phenomenon of Brownian motion, discussed in Sec. 1.4, is thus an 
immediate consequence of our theory. The quantitative statement 
(56) also shows explicitly that, if the mass m of the particle is sufficiently 
large, the fluctuations become so small as to be unobservable. 


᾿ Harmonic oscillator 


tps t= 


"- 


Is 


Consider a particle of mass m performing simple harmonic oscillations 
in one dimension. Its energy is then given by 
1 


1 2 a 2 
By Pe + yr. (57) 


ex, 


Here the first term is the kinetic energy of the particle whose momen- 
tum is denoted by p,. The second term is its potential energy if a dis- 
placement x of the particle produces a restoring force — ax, where a is 
a constant (called the spring constant). Suppose that the oscillator is 
in equilibrium with a heat reservoir at a temperature T high enough so 
that the oscillator can be described in terms of classical mechanics. 
Then the equipartition theorem (48) can be immediately applied to 
each of the quadratic terms in (57) to give for the mean energy of the 
oscillator 


é = ΤῸ HAT = KT. (58) 


6.7 The Specific Heat of Solids 

As a last application of the equipartition theorem, we shall discuss the 
specific heat of solids at temperatures sufficiently high so that a classical 
description is valid. Consider thus any simple solid consisting of 
N atoms; e.g., the solid might be copper, gold, aluminum, or diamond. 
By virtue of the mutual forces between neighboring atoms, the situa- 
tion of stable mechanical equilibrium of the solid is that in which its 
atoms are located at regular positions in a crystal lattice. Each atom 
is, however, free to move by small amounts about its equilibrium posi- 
tion. The force (due to neighboring atoms) tending to restore the atom 
to its equilibrium position vanishes, of course, when the atom is actually 
located at its equilibrium position. Since the displacement of the atom 


= 


1 


Ins 


is 
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from its equilibrium position is always quite small, the restoring force 
must, in first approximation, be simply proportional to the atomic dis- 
placement. This approximation, which is usually excellent, thus im- 
plies that the atom performs about its equilibrium position simple 
harmonic motion in three dimensions. 

With proper choice of orientation of the x, y, z coordinate axes, the 
motion of an atom along any one of these axes, say along the x direc- 
tion, is then a simple harmonic motion with an associated energy ες of 
the form (57), ie., 


ee Se ae τς 
= ape + yar. (59) 


Here p; denotes the x component of momentum of the atom while x 
denotes the x component of displacement of the atom from its equilib- 
rium position. In writing (59), we have supposed that the atom has 
a mass m and is subject to a restoring force of spring constant «. Cor- 
respondingly, the (angular) frequency of oscillation of the atom along 
the x direction is then given by 


o= νξ- (60) 


τ, Similar expressions hold for the energies εὐ and ες associated with the 


motion of the atom in the y and z directions. The total energy of the 
atom is thus of the form 


€=e+&4+6&. (61) 


If the solid is in equilibrium at an absolute temperature T which is 
high enough so that the approximation of classical statistical mechanics 
is valid, the equipartition theorem is immediately applicable to each 
of the quadratic terms of (59). The mean value of ες is thus simply 


ἐς; = $kT + 317 = ΚΤ. (62) 
Similarly ἐν = @ = kT. By (61), the mean energy of an atom is then 
€ = 3kT. 


The mean energy of one mole of the solid, containing Avogadro's 
number N, of atoms, is thus simply 


E = 3N,kT = 3RT (63) 


where R = N,k is the gas constant. In accordance with (5.23), the 
molar specific heat cy at constant volume of the solid is then given by 


Pw 


Table 6.1 Values of the molar specific heat _, 
cy at constant pressure and the molar specific 
heat cy at constant volume for some simple 
solids at a temperature T= 298°K. The 
values of cy were deduced from the directly 
measured values of c, by applying some small 
corrections. All values are expressed in units 
of joules deg? mole~?, [Data taken from 
Dwight E. Gray (ed.), American Institute of 
Physics Handbook, 2d ed., p. 4-48 (McGraw- 
Hill Book Company, New York, 1963).] 
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or= (3), 


: es 
Using the numerical value (5.4) of R, this becomest 
cy & 25 joules mole~! deg™}. (65) 


Note the extreme generality of the result (64). It is completely in- 
dependent of the atomic mass or of the magnitude of the spring con- 
stant a. If the solid contained atoms having different masses or having 
different spring constants, the result (64) would thus still remain valid. 
If the solid were not isotropic, the restoring force on an atom would 
have a different magnitude along different directions and the spring 
constants for the x, y, and z directions would also differ correspond- 
ingly; but the value of the mean energy per atom would still be 3kT 
and (64) would still be valid. Indeed, the rigorous analysis of the 
simultaneous vibration of all the atoms in the solid shows that a descrip- 
tion in terms of individual motions of single atoms is inadequate and 
that simple harmonic motion is really performed by atoms moving to- 
gether in groups of different sizes.} But since (64) is independent of 
the masses and spring constants, it still remains valid. The only limita- 
tion on (64) is thus that the temperature be sufficiently high so that the 
classical approximation is valid. Hence (64) asserts that: 


At sufficiently high temperatures, all solids have the same 


temperature-independent molar specific heat cy equal 
to 3R. 


(66) 


We shall see presently that in the case of most solids (diamond being 
a conspicuous exception) room temperature is sufficiently high for the 
approximate validity of the classical discussion. ] 
Historically, the validity of statement (66) was first discovered | 
empirically and is known as the law of Dulong and Petit. Table 6.1 
lists directly measured values of the molar specific heat c, (at constant | 
pressure) for some solids at room temperature. The values of the molar | 


Ξ specific heat cy (at constant volume) can be obtained from the corre- 


+ In terms of calories, cy = 6 cal mole“! deg"? 
{That is, by the normal modes of the solid. 
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sponding values of c, after applying some small corrections.+ It is 
seen that the values of cy listed in the table are, on the whole, in rather 
good agreement with the value (65) predicted by the classical theory. 
Bad discrepancies occur, however, in the cases of silicon and, particu- 
larly, diamond. The reason is that quantum effects are still important 
for these solids even at a temperature as high as 300°K. 


Validity of the classical approximation 

Let us then ask under what circumstances the preceding classical dis- 
cussion is expected to be valid. The criterion is again provided by the 
condition (3). Consider thus a vibrating atom which has an energy ἐς 
associated with its motion in the x direction. By virtue of the equiparti- 
tion theorem applied to (59), the momentum p, of the atom is then 
such that 


The momentum of an atom thus has a typical magnitude po of the 
order of 


po = Vp2 = VmkT. (67) 


To assure the validity of the classical description, quantum effects 
should not prevent us from localizing an atom within a typical dis- 
tance so of the order of the mean amplitude of vibration of this atom. 
But the equipartition theorem applied to (59) yields 


tax? = 4kT. 
The typical magnitude so of the atomic displacement is thus of the 


order of 
50 = V2 = ΜΕ : (68) 


Hence the condition (3) that the Heisenberg uncertainty principle be 
of negligible significance becomes simply 


f It is quite easy to make specific heat measurements on solids at constant atmospheric 
Pressure, allowing the volume of the solid to change slightly, but very difficult to devise an 
experimental arrangement guaranteed to prevent the solid from expanding its volume when 
its temperature is raised. Since the volume of a solid changes only slightly as a result of a 
temperature change, the difference between c, and cy is, however, quite small. It can 
readily be calculated from a knowledge of the measured macroscopic properties of the 
Particular solid under consideration. 


"" 


- 


"0 


Fig. 6.15  Surface-view of a solid whose atoms 


are arranged in a simple cubic lattice. 
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sopo =P BA 


where w is, by (60), the typical (angular) frequency of oscillation of an 
atom in the solid. Equivalently the criterion (69) for the validity of 
the classical approximation can be written in the form 


or 


T>9, 


where Θ = 2 


(69) 


(70) 


is a temperature parameter characteristic of the solid under considera- 


tion. 


Numerical estimates 


The magnitude of the atomic frequency 
of oscillation ὦ can be estimated from 
the elastic properties of the solid under 
consideration. Suppose, for example, 
that a small pressure Ap is applied to a 
solidt; as 8 result, the volume V of the 
solid will decrease by some small amount 
AV. The quantity x defined by 


-Ὡ Bg 

a ΕΝ in (7) 
is called the compressibility of the solid 
(the minus sign being introduced to make 
x positive). It is readily measured and 
provides some information about the 

forces between the atoms in the solid. 
Let us then try to deduce from the 
compressibility x a rough estimate of the 
net force F acting on an atom when it is 
displaced from its equilibrium position in 
the solid. Imagine, for simplicity, that 
the atoms of the solid are located at the 
centers of cubes of edge length a so that 
the interatomic spacing is also equal to a. 
An excess pressure Ap applied to 8 sur- 
face of the solid corresponds then to 8 
force F = a? Ap applied tp the area αὐ 
occupied by a single atom (see Fig. 6.15). 
Furthermore, the fractional volume 


change of the solid under the influence 
of the excess pressure Ap is equal to the 
fractional volume change occupied per 
atom so that 

AV _ Ala) _ Sata _ 38a 

fo pe i ἊΝ 
Using the definition (71) of the compres: 4 
sibility, the force F on an atom is then ~ 
related to Aa by 


raeapno(-18) 


or Ἐπ παδὰ 


where the constant a, relating the force F 
to the displacement Δα of the atom from 
its equilibrium position, is given by 


— She 


(72) 


With our simple approximation of an g 
assumed cubic lattice of atoms in the 
solid, the estimated frequency of vibra- 
tion (60) of an atom in the solid should 
then be approximately 


ω- νῈΞ J= (73) 


+ No confusion should arise between the symbol p used for pressure and the same symbol 


used elsewhere to denote momentum. 


ws 


τω 


Sec. 6.7 


To get an appreciation for the relevant 
magnitudes, let us estimate w in the case 
of copper. The measured parameters of 
this metal aret: % 


atomic weight 


n= 635, 
density 

p = 8.95 gm cm-3, 
compressibility 


κα 7.3 Χ 10719 cm? dyne“}, 


From these numbers we find for the 
atomic mass 


ae tia. Se 
“Na 6.02 % 1083 
= 1.05 x 10-2 gm. 


Since p = m/a, the interatomic distance 
is equal to 

2." ΕἸ = x i 
ρ % 8.95 

= 2.34 x 10- em, 


Hence (73) gives for the angular fre- 
quency of vibration 


ἫΝ [ 3(23.34 x 10-8) "Ἢ 
~ U(7.3 x 10-75)(1.05 x 10-22) 
= 3.02 x 1019 radians/sec, 


or for the corresponding frequency 
» = 2 = 48 x 1015 cycles/see. (74) 
π 
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This is 8 frequency lying in the infrared 
region of the electromagnetic spectrum. 

The characteristic temperature Θ de- 
fined in (70) is then equal to 


ho _ (1.054 x 10-27)(3.02 χ 1013) 
et (1.38 x 10716) 
= 230°K. (75) 


Θ 


Hence the classical result ον τ-- 38 
should be valid for copper if T>> 230°K, 
i.e., it should begin to be reasonably 
accurate for temperatures of the order of 
room temperature or above. 

On the other hand, let us consider a 
solid such as diamond. The atomic 
mass of one of its carbon atoms is 12, 
i.e., about 5 times less than that of a cop- 
per atom. In addition, diamond is a 
very hard solid so that its compressibility 
is very low, about 3 times less than that 
of copper (x = 2.26 x 10-13. cm? dyne-1), 
Thus the frequency of vibration w of a 
carbon atom in diamond is, by (73), 
much higher than that of a copper atom 
in copper metal. More precisely, for 
diamond (density p = 3.52 gm cm-3) the 
estimated temperature parameter Θ ~ 
830°K. The classical approximation is, 
therefore, not expected to be applicable 
to diamond at room temperature and the 
low value of cy for diamond in Table 6.1 
is by no means surprising. 


It is clear that the classical result cy = 3R must break down at low 
temperatures where the condition (69) is not satisfied. Indeed, the 
very general result (5.32) implies that, as the temperature is reduced 
below the range of validity of (69), the specific heat cy must ultimately 
decrease so as to approach zero as T->0. Any correct quantum- 
mechanical calculation must yield this limiting result. If every atom 
in the solid is assumed to vibrate with the same frequency «, the 
quantum-mechanical calculation of the specific heat cy can be carried 
out quite easily to give an approximate expression for cy valid for all 
temperatures. Details will be left as an exercise in Prob. 6.21. 


1 The data are taken from Dwight E. Gray (ed.), American Institute of Physics Hand- 
book, 2d ed. (McGraw-Hill Book Company, New York, 1963). 
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Summary of Definitions 
phase space A cartesian multidimensional space whose axes are labeled by all the coor- 
dinates and momenta describing a system in classical mechanics. A point in this space 
specifies all the coordinates and momenta of the system. 
Maxwell velocity distribution ‘The expression - 
flv) ὧν α ε΄ αὐῆνενε dy Ἂ 


giving the mean number of molecules having a velocity between v and v + dv in ἃ gas 
at the absolute temperature T. It is merely a special case of the canonical distribution. 

effusion The outflow of molecules from a container through a small hole whose size is 
much smaller than the molecular mean free path. 


Important Relations 


If a system described classically is in equilibrium at the absolute temperature T, 
every independent quadratic term εἰ of its energy has a mean value 


& = 47. @ 


Suggestions for Supplementary Reading 

O. R. Frisch, “Molecular Beams,” Sci. American 212, 58 (May 1965). A good discussion 
of the wide range of fundamental physical experiments made possible by molecular beam 
investigations. 

F, Reif, Fundamentals of Statistical and Thermal Physics, chap. 7 (McGraw-Hill Book 
‘Company, New York, 1965). A somewhat more detailed discussion of the topics of the 
present chapter. 

F.W. Sears, An Introduction to Thermodynamics, the Kinetic Theory of Gases, and Statisti- 
cal Mechanics, 2d ed., chaps. 11 and 12 (Addison-Wesley Publishing Company, Inc., 
Reading, Mass., 1953). : 

D. K. C. MacDonald, Faraday, Maxwell, and Kelvin (Anchor Books, Doubleday ὃς Com- 
pany, Inc., Garden City, N.Y., 1964). This book contains a short account of Maxwell's 
life and scientific work. ᾿ 
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Problems 


6.1 Phase space of a classical harmonic oscillator 
The energy of a one-dimensional harmonic oscillator, whose position 
coordinate is x and whose momentum is p, is given by 
i τ ΕΞ τιν᾽ + fax? 
where the first term on the right is its kinetic and the second term its potential 
energy. Here πὶ denotes the mass of the oscillating particle and a the spring 
constant of the restoring force acting on the particle. 

Consider an ensemble of such oscillators, the energy of each oscillator being 
known to lie between E and E + δὲ. Treating the situation classically, indicate 
in the two-dimensional ap phase space the region of states accessible to the 
oscillator. 


6.2 Ideal gas in a gravitational field 

An ideal gas at the absolute temperature T is in equilibrium in the pres- 
ence of a gravitational field described by an acceleration g in the downward 
(or —2) direction. The mass of each molecule is m. 


(a) Use the canonical distribution in its classical form to find the probability 
P(r,p) dr d3p that a molecule has a position between r andr + dr and a momen- 
tum between p and p + dp. 


() Find (to within a trivial constant of proportionality) the probability 
Pv) dv that ἃ molecule has a velocity between v and v + dy, irrespective of 
its position in space. Compare this result with the corresponding probability in 
the absence of a gravitational field. 


(c) Find (to within a trivial constant of proportionality) the probability 
P"(z) dz that a molecule is located at a height between z and z + dz, irrespective 
of its velocity or its location in any horizontal plane. 


63 Macroscopic discussion of an ideal gas in a gravitational field 

Consider the ideal gas of the last problem from a completely macroscopic 
Point of view. By writing down the condition of mechanical equilibrium for a 
slice of the gas located between the heights z and z + dz, and by using the equa- 
tion of state (4.92), derive an expression for n(z), the number of molecules per 
unit volume at a height z. Compare this with the result for P'(z) dz derived in 
the last problem on the basis of statistical mechanics. 
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6.4 Spatial distribution of electrons in a cylindrical electric field 

A wire of radius τῷ is coincident with the axis of a metal cylinder of 
radius R and length L. The wire is maintained at a positive potential of V stat- 
volts with respect to the cylinder. The whole system is at some high absolute 
temperature T. As a result, electrons emitted from the hot metals form a dilute 
gas filling the cylindrical container and in equilibrium with it. The density of 
these electrons is so low that their mutual electrostatic interaction can be 
neglected. 


(a) Use Gauss’s theorem to obtain an expression for the electrostatic field 

~ which exists at points at a radial distance r from the wire (το « τ « R). The 

cylinder of length L may be assumed to be very long so that end effects are 
negligible. 


(b) In thermal equilibrium, the electrons form a gas of variable density which 
fills the entire space between the wire and cylinder. Using the result of part (a), 
find how the number n of electrons per unit volume depends on the radial dis- 
tance r. 


(c) Give an approximate criterion determining how low the temperature T 
must be so that the electron density is small enough to justify the approximation 
of neglecting the mutual electrostatic interaction between the electrons. 


6.5 Determination of large molecular weights by the ultracentrifuge 
Consider a macromolecule (i.e., a very large molecule with a molecular 

weight of several millions) immersed in an incompressible fluid of density p at 
the absolute temperature T. The volume v occupied by one such molecule can 
be considered known since the volume occupied by a mole of macromolecules 
can be determined by volume measurements on a solution of macromolecules. 
A dilute solution of this type is now placed in an ultracentrifuge rotating with a 
high angular velocity ©. In the frame of reference rotating with the centrifuge, 
any particle of mass m at rest with respect to this frame is then acted upon by 
an outward centrifugal force mw*r, where r denotes the distance of the particle 
from the axis of rotation. 

(a) What is the net force acting in this frame of reference on a macromolecule 
of mass m, if the buoyancy effect of the surrounding fluid is taken into account? 

(b) Suppose that equilibrium has been attained in this frame of reference so 
that the mean number n(r) dr (per unit volume) of macromolecules located at ἃ 
distance from the axis of rotation between r and r + dr is independent of time. 
Apply the canonical distribution to find (to within a constant of proportionality) 
the number n(r) dr as a function of σ᾿ 

(c) Measurements of the relative number n(r) of molecules as a function of r 


can be made by measuring the absorption of light by the solution. Show how 


such measurements can be used to deduce the mass m of a macromolecule. 
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*6.6 Spatial separation of magnetic atoms 
in an inhomogeneous magnetic field 
An aqueous solution at room temperature T contains a small concen- 
tration of magnetic atoms, each of which has a spin $ and magnetic moment Jip. 
The solution is placed in an external magnetic field. The magnitude of this 
field is inhomogeneous over the volume of the solution. To be specific, the 
= component B of this magnetic field is a uniformly increasing function of z, 
assuming a value B; at the bottom of the solution where z = = anda larger value 
By at the top of the solution where z = zo. 

(a) Let π, (5) dz denote the mean number of magnetic atoms whose magnetic 
moment points along the z direction and which are located between z and 
z+ dz. What is the ratio n,(z2)/n,(z)? 

(b) Let n(z) dz denote the total mean number of magnetic atoms (of both 
directions of spin orientation) located between z and z + dz. What is the ratio 
n(z2)/n(z1)? 15 it less than, equal to, or greater than unity? 

(c) Make use of the fact that μο8 < kT to simplify the answers to the preced- 
ing questions. 

(d) Estimate numerically the magnitude of the ratio n(z)/n(z:) at room tem- 
perature if μο ~ 1030 erg/gauss is of the order of a Bohr magneton, B; = 0, and 
B, = 5 x 109 gauss, 


6.7 Most probable energy of a molecule in a gas 

What is the most probable kinetic energy € of a molecule described by 
ἃ Maxwellian velocity distribution? Is it equal to m6, where 6 is the most 
probable speed of the molecule? 


6.8 Temperature dependence of effusion 
Molecules of a gas enclosed within a container effuse through a small hole 

into a surrounding vacuum. Suppose that the absolute temperature of the gas 
in the container is doubled while its pressure is kept constant. 

(2) By what factor does the number of molecules escaping per second through 
the hole change? 

(b) By what factor does the force change which is exerted on a vane suspended 
at some distance in front of the hole? 


6.9 Mean kinetic energy of an effusing molecule 

The molecules of a monatomic ideal gas are escaping by effusion through 
Ὁ small hole in a wall of an enclosure maintained at an absolute temperature T. 
By physical reasoning (without actual calculation) do you expect the mean kinetic 
©nergy € of a molecule in the effusing beam to be equal to, greater than, or less 
than the mean kinetic energy of a molecule inside the enclosure? 


Fig. 6.16 An effusing beam impinging upon _ 
a vane. 
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6.10 Pressure drop in gas container having a small leak 

‘A thin-walled vessel of volume V, kept at constant temperature Τὶ con- 
tains a gas which slowly leaks out through a small hole of area A. The outside 
pressure is low enough so that leakage back into the vessel is negligible. Estimate 
the time required for the pressure in the vessel to decrease to half of its original 
value. Express your answer in terms of A, V, and the mean molecular speed 0. 


6.11 Cryogenic (i.e., low-temperature) pumping 

Gases can be removed from a container by lowering the temperature 
of some of its walls, This is a method commonly used to achieve the good 
vacuums necessary for many physical experiments. To illustrate the principle 
of the method, consider a spherical bulb 10 cm in radius which is maintained at 
room temperature (300°K) except for a 1-cm? spot which is kept at liquid nitro- 
gen temperature (77 Κ)2, The bulb contains water vapor originally at a pressure 
of 0.1 mm of mercury. Assuming that every water molecule striking the cold area 
condenses and sticks to the surface, estimate the time required for the pressure to 
decrease to 10-6 mm of mercury. 


6.12 Separation of isotopes by effusion 

A vessel has porous walls containing very many tiny holes. Gas mole- 
cules can pass through these holes by effusion and then be pumped off to some 
collecting chamber. The vessel is filled with a dilute gas consisting of two kinds 
of molecules which have different masses m; and mg by virtue of the fact that 
they contain two different isotopes of the same atom. Let us denote by c; the 
concentration of the first type of molecules in the vessel, and by cz the concentra- 
tion of the second type. (The concentration οἱ is the ratio of the number of mole- 
cules of type i to the total number of molecules.) These concentrations can be 
kept constant in the vessel by providing a steady flow of fresh gas through it so 

- as to replenish any gas that has effused. 

(a) Let cy’ and co’ denote the concentrations of the two types of molecules in 
the collecting chamber. What is the ratio co’/cy’? 

(Ὁ) By using the gas UF, one can attempt to separate 3351 from 288U, the first 
of these isotopes being the one useful in the initiation of nuclear-fission reactions. 
The molecules in the vessel are then 288U!®Fg and 235U!®Fg. (The concentra- 
tions of these molecules, corresponding to the natural abundance of the two 
uranium isotopes, are cogs = 99.3 per cent and 6:85 = 0.7 per cent.) Calculate 
the corresponding ratio c’z35/c’s3s of the molecules collected after effusion. 
Express the result in terms of the original concentration ratio C235/Cc2ss- 


6.13 Change of concentration as a result of effusion 

A container has as one of its walls a membrane containing many small 
holes. If the container is filled with gas at some moderate pressure Ὁ, this gas 
will escape by effusion into the vacuum surrounding the container. It is found 
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that when the container is filled with helium gas at room temperature and at a 
pressure pi, the pressure will have fallen to 4p after one hour. 

Suppose that the container is filled at room temperature and at a total pres- 
sure Ὁ with a mixture of helium (He) and neon (Ne), the atomic concentrations 
of both gases being 50 per cent (i.e., 50 per cent of the atoms are He and 50 per 
cent of them are Ne). What will be the ratio nye/nue of the atomic concentra- 
tions of Ne to He after one hour? Express your answer in terms of the atomic 
weights fine of neon and μῃς of helium. 


6.14 Calculation of mean values for a molecule in a gas 

A gas of molecules, each having mass m, is at rest in thermal equilibrium 
at the absolute temperature T. Denote the velocity of a molecule by ν, its three 
cartesian components of velocity by Ὁ,, v), and v;, and its speed by v. Find the 
following mean values: 


(6) (v2 + bey)*, where Ὁ is a constant. 
(Suggestion: Symmetry arguments and the equipartition theorem should suffice 
to answer all these questions without any significant calculation.) 


6.15 Doppler broadening of spectral lines 

A gas of atoms, each of mass m, is maintained at the absolute tempera- 
ture T inside an enclosure. The atoms emit light which passes (in the x direction) 
through a window of the enclosure and can then be observed as a spectral line 
in a spectroscope. A stationary atom would emit light at the sharply defined 
frequency . But, because of the Doppler effect, the frequency of the light 
observed from an atom having an x component of velocity Ὁ, is not simply equal 
to the frequency vo, but is given approximately by 


ve νο(! +4) 


where cis the velocity of light. Asa result, not all of the light arriving at the spec- 
troscope is at the frequency yo; instead it is characterized by some intensity dis- 
tribution I(v) dy giving the fraction of light intensity lying in the frequency range 
between ν and ν + dv. 

(a) Calculate the mean frequency 7-of the light observed in the spectroscope. 

(Ὁ) Calculate the dispersion (A>)? = (vy — ¥)? in the frequency of the light 
observed in the spectroscope, 

(c) Show how measurements of the width Av = [(Ay)2]12 of a spectral line 
observed in the light coming from a star allow one to determine the temperature 
of that star, 
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6.16 Specific heat of an adsorbed mobile monolayer 

If the surface of some solid is maintained in a reasonably good vacuum, 
a single layer of molecules, one molecular diameter thick, can form on this sur- 
face. (The molecules are then said to be adsorbed on the surface.) The mole- 
cules are held to this surface by forces exerted on them by the atoms of the solid, 
but they may be quite free to move in two dimensions on this surface. They 
form then, to good approximation, a classical two-dimensional gas. If the mole- 
cules are monatomic and the absolute temperature is T, what is the specific heat 
per mole of molecules thus adsorbed on a surface of fixed size? 


6.17 Temperature dependence of the electrical resistivity of a metal 

The electrical resistivity p of a metal is proportional to the probability 
that an electron is scattered by the vibrating atoms in the lattice, and this prob- 
ability is in tum proportional to the mean square amplitude of vibration of these 
atoms. How does the electrical resistivity p of the metal depend on its absolute 
temperature in the range near room temperature, or above, where classical sta- 
tistical mechanics can validly be applied to discuss the vibrations of the atoms 
in the metal? 


6.18 Theoretical limiting accuracy of a weight measurement 
A very sensitive spring balance consists of a quartz spring suspended 

from a fixed support. The spring constant is a, i.e., the restoring force of the 
spring is —ax if the spring is stretched by an amount x. The balance is at 
an absolute temperature T in a location where the acceleration due to gravity is g. 

(a) If a very small object of mass M is suspended from the spring, what is the 
mean resultant elongation X of the spring? 

(b) What is the magnitude (Ax? = (x — 3) of the thermal fluctuations of the 
object about its equilibrium position? 

(c) It becomes impracticable to measure the mass of an object when the 
fluctuations are so large that [{Δ1]2]} 2 > % Whatis the minimum mass M which 
can be measured with this balance? 


6.19 Specific heat of anharmonic oscillators 

Consider a one-dimensional osciliator (not simple harmonic) which is 
described by a position coordinate x and by a momentum p and whose energy 
is given by 


c= fi ταὶ @ 
ς ne 

where the first term on the right is its kinetic energy and the second term is its 
potential energy. Here m denotes the mass of the oscillator and b is some con- 
stant. Suppose that this oscillator is in thermal equilibrium with a heat reservoir 
at a temperature T high enough so that the approximation of classical mechanics 
is a good one. 
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(a) What is the mean kinetic energy of this oscillator? 

(b) What is its mean potential energy? 

(c) What is its mean total energy? 

(ἀν) Consider an assembly of weakly interacting particles, each vibrating in 
one dimension so that its energy is given by (i). What is the specific heat at con- 
stant volume per mole of these particles? 


(Suggestion: There is no need to evaluate explicitly any integral to answer 
these questions.) = 


6.20 Specific heat of a highly anisotropic solid 

Consider a solid which has a highly anisotropic crystalline layer struc- 
ture. Each atom in this structure can be regarded as performing simple harmonic 
oscillations in three dimensions. The restoring forces in directions parallel to 
a layer are very large; hence the natural frequencies of oscillations in the x and 
y directions lying within the plane of a layer are both equal to a value ὧν which 
is so large that hw; 15» 300k, the thermal energy kT at room temperature. On the 
other hand. the restoring force perpendicular to a layer is quite small; hence the 
frequency of oscillation ὦ, of an atom in the z direction perpendicular to a layer 
is so smal] that hw, < 300k. On the basis of this model, what is the molar 
specific heat (at constant volume) of this solid at 300°K? 


621 Quantum theory of the specific heat of solids 
To treat the atomic vibrations in a solid by quantum mechanics, use as 
a simplifying approximation a model which assumes that each atom of the solid 
vibrates independently of the other atoms with the same angular frequency w in 
each of its three directions. The solid consisting of N atoms is then equivalent 
to an assembly of 3N independent one-dimensional oscillators vibrating with 
the frequency . The possible quantum states of every such oscillator have 
discrete energies given by 
ἐν = (n+ dhe iC) 
where the quantum number n can assume the possible values n = 0,1.2,3,.... 
(a) Suppose that the solid is in equilibrium at the absolute temperature T. By 
using the energy levels (i) and the canonical distribution, proceed as in Prob. 4.22 
to calculate the mean energy ¢ of an oscillator and thus also the total mean energy 
E = Né of the vibrating atoms in the solid. 


(b) Using the result of part (a), proceed as in Prob. 5.20 to calculate the molar 
Specific heat cy of the solid. 


264 Canonical Distribution in the Classical Approximation 


(c) Show that the result of part (b) can be expressed in the form 


= wer . 
cy=3R er - ὩΣ (ii) 

—tw_@ - 

where w=ar = T (iii) 


and where © = tw/k is the temperature parameter previously defined in (70). 

(4) Show that, when ΤῊ» Θ, the result (ii) approaches properly the classical 
value cy = 3R. 

(e) Show that the expression (ii) for cy approaches properly the value zero as 
T>0. 

(f) Find an approximate expression for the result (ii) in the limit when T< 0. 

(6) Make a rough sketch of ον 85 ἃ function of the absolute temperature T. 

(h) Apply the criterion (1) to find below what temperature the classical 
approximation is not expected to be applicable. Compare your result with the 
condition (69) for the applicability of the classical theory of specific heats. 

[Using the approximations made in this problem, Einstein first derived the 
expression (ii) in 1907. Using the novel quantum ideas, he was thus able to 
account for the experimentally observed specific heat behavior which had been 
inexplicable on the basis of the classical theory.] 


Appendix 


A.l Gaussian Distribution 
Consider the binomial distribution derived in (2.14), 


Pn) = oe Me (1) 


where q = 1 — p. When Nis large, the calculation of the probability 
P(n) appears difficult since it requires the computation of the factorials 
of large numbers. It then becomes possible, however, to use approxi- 
mations which allow us to transform the expression (1) into a particu- 
larly simple form. 

The simplifying feature, already pointed out in Sec. 2.3, is that the 
probability P(n) tends to exhibit a maximum which is quite pronounced 
when N is large. The probability P(n) thus becomes negligibly small 
whenever n differs appreciably from the particular value n = ἢ where 
Pis maximum. Hence the region ordinarily of interest, that where the 
probability P(n) is not negligible, consists only of those values of n 
which do not differ very much from ἢ. But in this relatively small re- 
gion an approximate expression for P(n) can readily be found. This 
expression can then be used for all values of n where the probability P 
is not negligibly small, i-e., in the entire domain where a knowledge of 
P is ordinarily of interest. 

It is thus sufficient to investigate the behavior of P(n) near the posi- 
tion ἢ of its maximum. Note first that, unless p = 0 or q = 0, this 
value fi is neither very close to 0 nor to N; thus ἢ itself is also a large 
number when Nis large. The numbers n in the region of interest near 
fare thus also large. But when n is large, P(n) changes relatively little 
when n changes by unity; ie., 

|P(n + 1) — P(n)| < Pin) 

so that P is a slowly varying function of n. It is, therefore, a good 
approximation to regard P as a smooth function of a continuous vari- 
able n, although only integral values of n are of physical relevance. A 
second useful observation, concerns the fact that the logarithm of P is 
a much more slowly varying function of n than P itself. Instead of 
dealing directly with P, it is thus easier to investigate the behavior of 
In P and to find for In P a good approximation valid in a large domain 
of the variable n. : 
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Taking the logarithm of (1), we thus obtain 
In P= InN! — Inn! —In(V—n)! Ὁ ninp+(N—n)ing. (2) 


The particular value n = ἢ where P has its maximum is then deter- 
mined by the condition 


ar 
uae 0 
or, equivalently, by the condition that In P is maximum, 
dinP τὰρ΄ 
ane Pa =e 8) 


To differentiate the expression (2), we note that all the numbers 
occurring as factorials are large compared to unity. We can, therefore, 
apply to each of these the approximation (M.7) which asserts that for 
any number m, large enough so that m > 1, 


din ml ~ tn τι, 4) 


Differentiation of (2) with respect to n then yields to good approxi- 
mation 


OB? = —inn + n(V—n) +Inp—Ing. 6) 


To find the maximum of P we equate the expression (5) to zero in 
accordance with (3). Thus 


[=P]. 


or W=n)p ἘΞ ἢ 
Ῥ Ὁ 

Hence (N — n)p = nq 

or Np = n(p + q). 


Since p + q = 1, the value n = ii where P has its maximum is then 
given by 
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To investigate the behavior of In P near its maximum, we need only 
expand it in a Taylor series about the value ἢ. Thus we can write 


dine fo... 
+ alae με @ 
where ySn-ii (8) 


and where the square brackets indicate that the derivatives are to be 
evaluated atn = fi. The first derivative vanishes since we are expand- 
ing about a maximum where (3) is satisfied. The other derivatives can 
be found by successive differentiation of (5). In particular, 


@inP_ 1 1 N 


dn? n N—-n n(N — πη) 


Evaluating this derivative where n= ἢ, ie., where n= Np and 
Ν-- π τ ΝᾺ — p) = Nq, we find 


Saal miele 
dn? |” Νρη᾽ 
Hence (7) becomes 


In P(n) = ln Pa) — 5h + 


or P(n) = Pe-v2/2Npq ... = Pe-n—W?/2Npa ... (9) 


where we have written P= P(n). 

Note that the probability P(n) in (9) becomes negligible compared 
to its maximum value P when y becomes so large that y?/(Npq) > 1 
or |y| > (Npq), since the exponential factor is then very much 
smaller than unity. The probability P(n) is thus only appreciable in 
the domain where |y| < (Npq)!/*. But there y is ordinarily sufficiently 
small so that the terms in (7) which involve y? and higher powers of y 
are negligible compared to the leading term, involving y?, which we 
retained.t Hence we canonclude that (9) is indeed a good approxi- 
mation to the probability P(n) in the entire region where this proba- 
bility has appreciable magnitude. 


} This statement is true to the extent that (Npq)’?> 1. See Prob. P. 3. 
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The value of the constant Pin (9) can be expressed directly in terms 
of p and q by using the normalization condition 


DSP) =1 (10) 


where the summation is over all possible values of n. Since P(n) 
changes only slightly between the successive integral values of n, this 
sum can be replaced by an integration, A range of n of magnitude dn 
(much larger than unity) contains dn possible values of P(n). Hence 
the condition (10) becomes 


[Ῥω dn = Γερο an 


= Pf” e-v2Nm dy = 1, (11) 


Here we have made the simplification of extending the range of inte- 
gration from —oo to +00. This is an excellent approximation since 
P(n) is negligibly small anyhow whenever |n — ἢ] becomes sufficiently 
large. Using the relation (M.23), the integral in (11) yields simply 


P\/2aNpq ἐς ἃ, 
Thus Pos (12) 


Using this result and the value ἢ = Np given by (6), the expression (9) 
for the probability P(n) then becomes 


P(n) = —4 ___¢-n-npp2/2Np0, (13) 


V 2aNpq 


Note that this expression is much easier to evaluate than (1) since it 
does not require the calculation of any factorials. 

A probability of the functional form given by the right side of (9) or 
(13) is known as a Gaussian distribution. The essential argument 
leading to this distribution, i.e., a power series expansion of a logarithm, 
is widely applicable. It is therefore not surprising that Gaussian dis- 
tributions occur very frequently in statistical arguments whenever the 
numbers under consideration are large. 


An An 


Fig. A.l Gaussian distribution showing as a 
smooth curve the values of the probability 
P(n) as a function of n. The probability W(x) 
that n assumes a value between ἢ — x and 
ii + x is given by the area lying in this range 
below the curve. If An denotes the standard 
deviation of n, computation shows that 


W(An) = 0.683, W(2An) = 0.954, 
and W(3An) = 0.997. 
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The expression (13) for P(n) can be 
readily used to calculate various mean 
values of n. The calculation of sums can 
be reduced to an evaluation of equivalent 
integrals in the same manner as that 
used in calculating the normalization 
condition (10) by means of (11). Thus 
‘one finds 


A=> Pin 


Here the first integral is the same as that 
occurring in (11) and has the value 
(2Npgq)/?. The second integral van- 
ishes by symmetry since its integrand is 
odd (i.e., is of opposite sign for +y and 
—y) so that contributions to the integral 
near +y and —y cancel each other. 
Thus we are left simply with 


4) 


This shows that the mean value of n is 
equal to the value i= Np where the 
probability P is maximum. 

Similarly one finds for the dispersion 
of n the result 


(Gap = (n— A? = δ᾽ Pin)(n — Np)? = (22Npq)” 


= @aNpgy12{__ ev 2 oaye dy. 


Using (M.26), this integral yields 


Gin} = Νρᾳ. as) 
The standard deviation of n is thent 
An = VNpq. as) 


The Gaussian distribution (13) can 
thus be expressed solely in terms of the 
two parameters fi and An of (14) and (16) 


re) = he [-3 (CR) 
a7 


(@aNpq)-¥2f_e-ernvr 28a ndn = (2=Npq) afer 2xva (ἢ + y) dy 


ει = 
παν) ev /2¥ dy + (2aNpq)-22[e-¥8/280y dy, 


fT eo-nowate (x — Np dn 


Introducing the variable 


sothat on =n+med 
an 


Eq. (17) can also be expressed more com- | 
pactly as = 


P(n) An = 


symmetric about its mean value, 
P(r) has the same value for z and 


+ Note that (14) and (15) agree properly with the results (2.66) and (2.67) derived in the 
text under the most general conditions of arbitrary magnitude of N. 
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A.2 Poisson Distribution 
Consider again the binomial distribution derived in (2.14), 


Pin) = waar — py (18) 


In the preceding section we showed that, when N> 1, the expression 
(18) can be approximated by a Gaussian distribution valid in the entire 
domain where the probability P(n) is of appreciable magnitude (i.e., 
where it is not too far removed from the region of its maximum). We 
shall now examine an approximation of (18) valid in a different domain. 
This approximation becomes relevant when the probability p is suffi- 
ciently small so that 


p< (19) 
and when the number n of interest is sufficiently small so that 
n<n. (20) 


In contrast to the situation envisaged in discussing the Gaussian ap- 
proximation, the number n may here be arbitrarily small. 

Let us now examine the approximations made possible by the con- 
ditions (19) and (20). We note first that 


Tg N= λον τ 8). =n 


Since n << N, each of the n factors on the right is essentially equal to 
N. Hence we obtain the approximate result 


N! 
Wn) 


=N*. (21) 
We examine next the factor 
y=(1— py 
or, equivalently, its logarithm 
Iny = (V—n)In(1 — p). 


Since n< N, we can put N—n2=N. Furthermore, the condition 
P <1 allows us to approximate the logarithm by the first term in a 
Taylor expansion, i.e., to put In (1 — p) = —p. 
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Hence Iny = —Np 

or y=(1— py 5": εν». (22) 
Using the approximations (21) and (22) in the expression (18), we then 
obtain 

P(n) = Nt pnw 
ἈΝ. τι 

ΟΥ̓ P(n) = are - (23) 
where A=Np. (24) 


In terms of this definition of A, the condition (19) is equivalent to 


A<N. (25) 


The result (23) is called the Poisson distribution. Note that the 
factor πὶ in the denominator causes P(n) to decrease very rapidly when 
n becomes sufficiently large. Indeed, when A < 1, A” itself is a de- 
creasing function of n so that P(n) decreases monotonically as a func- 
tion of n. When A > 1, A” is an increasing function of n so that the 
factor A”/n!, and hence also P(n), tends to exhibit a maximum near 
Fig. A.2 The Poisson distribution P(n) of (23) n= ἃ before falling off for larger values of n.¢ In any event, when 
as 2 function of n. The two cases shown cor- >A, the probability P(n) becomes negligibly small. In the entire” 
respond to amean value= AofA=tand — region where n < A so that P(n) is not negligible, the condition (25) 
A=2. 3 then implies that n < λ «( Ν. The requirement (20) used in deriving 
the Poisson distribution is thus automatically satisfied wherever the 
probability P(n) is of appreciable magnitude. 

The parameter A defined in (24) is, by virtue of (2.66), equal to the” 
mean value i of n. Thus 3 


λ-π. (26) 


Note incidentally that, for a given value of or fi, the condition (25) 
or (20), requiring that p< 1, becomes increasingly well satisfied as 
Ν-» oo. The Poisson distribution thus becomes always applicable in 


this limit. ‘| 
| 

} When Nislarge and \ > 1, the Poisson distribution (23) reduces properly to a Gaussian 
distribution for values of n not too far removed from A. | 


ἢ 
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Explicit verification that A = 7 


The result (26) follows also directly from then obtain, puttingk =n—1, 


the Poisson distribution (23). Using the ee: τ 
definition of the mean value we have πεν τὶ). ἐλ ς aes να 
mat ο-ν ko 
» 
= ans 
a = 70m = eo Paes =OrS Berne 


kao 
No significant error results if this sum is 


extended to infinity since P(n) becomes 
negligibly small when n is large. Noting 
that the term with n= Ὁ vanishes, we 


since the last sum is simply the series 
expansion of the exponential function. 


Thus =r. (27) 


A.3 Magnitude of Energy Fluctuations 
Consider two macroscopic systems A and A’ in thermal interaction 
with each other. We shall use the notation of Sec. 4.1 and examine 
more closely the probability P(E) that A has an energy between E and 
E + δῈ. In particular, we should like to investigate the behavior of 
P(E) near the energy E = E where it is maximum. 

For this purpose we examine the slowly varying logarithm of P(E) 
given by (4.6), 


In P(E) = nC + In QE) + M QE), (28) 


and expand it in a Taylor series about the value E. Introducing the 
energy difference 


«=E-E, (29) 
the Taylor series for In 2(E) becomes 
In Q(E) = In QE) + [3.1 + Swe le. (30) 


Here the square brackets indicate that the derivatives are all evaluated 
for E= E. Terms involving powers of ε greater than «2 have been 
neglected. By introducing the abbreviations 


r= [282] ὃ 


i 88] ςὼ 
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Eq. (30) can be written in the simple form 
In Q(E) = In Q(E) + Be — Fre. (33) 


The minus sign has been introduced in the definition (32) for the sake 
of convenience so that the parameter y will turn out to be positive [in 
accordance with (4.32)]. 

A similar Taylor series can readily be written down for In Q'(E’), 
where Ε΄ = E* — E. Expanding about the value E'=E* — E, we 
have 


Ei — Ε΄ ----ἰ -- 8) = -« 
Hence we obtain, analogously to (30), 
In QE’) = InQ’(E’) + βί-- -- ἐγ --Ε (34) 
» — [ain Q 
where ΒΞ [ aE | 
— 0? In Q’] _ op’ 
and γΞ- -[555]- - [5] 


are defined analogously to (31) and (32) in terms of the derivatives 
evaluated at Ε΄ = Κ΄. Adding (33) and (34), we then obtain 


In {Q(E)Q(E)} = In {(E)O"(E')} + (B — Be — Hy + YE. (35) 


At the value E = E where P(E) = CQ(E)2’(E’) is maximum, it follows 
by (4.8) that β = ’; hence the term linear in € vanishes, as it should. 
Thus (28) can be written in the form 


In P(E) = In P(E) — ἐγοεῖ 


or P(E) = Ρ(Εγε αὐ ϑϑνοι (36) 


wheret γὙοξξγ τ Υ΄. (37) 


The result (36) shows that the value of yo must be positive to guar- 
antee that the probability P(£) has a maximum (rather than a minimum) 
at E= E. Indeed, it shows explicitly that P(E) becomes negligibly 
small compared to its maximum value when $yo(E — E)? > 1. ie., 
when |E — E| >> γο 142. In other words, it is very improbable that 


+ Note that our whole argument has been similar to that used in Appendix A.1 and that 
(36) is indeed a Gaussian distribution. 
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the energy of A lies far outside the range E + AE, wheret 
espe Ὁ: (38) 


The order of magnitude of AE can readily be estimated by using the 
definition (32) of y and the approximate expression (3.38) for Q(E). 
Thus we can write for any ordinary system with a ground state en- 
ergy Eo 

InQ ~ f(E — Eo) + constant. 


Hence we obtain by the definitions (31) and (32), evaluated for 
E=E=E, 


and v= -[4]-g4ee~F (39) 


‘This last relation shows explicitly that y is positive. It also shows that, 
for the given value of β at which the systems are in equilibrium with 
each other, the smaller system (i.e., the one with the smaller number of 
degrees of freedom) has the larger value of y. In (37) the magnitude 
of Yo is thus predominantly determined by the smaller of the two sys- 
tems. Suppose, for example, that A is much smaller than A’ so that 
y>>Y and yo = γι Then it follows from (38) and (39) that 


_E-f 


AE VF 


(40) 


Since f is a very large number in the case of a macroscopic system, 
Eq. (40) shows that the relative magnitude AE/(E — Eo) of energy 
fluctuations is very small. This result is discussed in greater detail in 
Sec. 4.1 where Eq. (4.10) is based on (40). 


# Since (36) depends only on the absolute value | £ — ΕἸ and is thus symmetric about 
the value E, the mean value of the energy must be equal to E, ie., Ε = E. This result is 
simply that already obtained in Eq. (A.17) for any Gaussian distribution. Similarly, it fol- 
lows from (A.17) that AE in (38) is equal to the standard deviation of the energy E. 


0 2 4 6 8 10 12 14 16 m 
Fig, M.1 Behavior of In m as a function of m. 
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Μ.4 Value of In πὶ for large n 

Since the calculation of n! becomes very laborious if n is large, we 
should like to find a simple approximation useful for computing n! in 
this case. By its definition 


mb=1xX2xX3xX---x(n-)I Χ π. 


n 
Hence Inn! =Inl+n2+4---+Ilnn= δ᾽ Inm. (8) 
m=1 

If n is large, all terms in the sum (8) (except the first few which are 
also the smallest) correspond to values of m large enough so that In m 
varies only slightly when m is increased by unity. The sum (8) (given 
by the area under the rectangles of Fig, M.1) can then be approximated 
with little error by an integral (giving the area under the continuous 
curve of Fig. M.1). With this approximation (8) becomes 


Inn = f" nxdx = [xinx — xf. (9) 
Hence, 


ifn>1, (10) 


since the contribution of the lower limit in (9) is then negligible. 
A better approximation, which is good within less than one per cent 
error in n! even if n is as small as 10, is given by Stirling's formula 


Inn! = πίη π -- π + $1n (2zn). (11) 
When n is quite large, n > Inn so that Stirling’s formula reduces to 


the simpler form (10). 
Note also that (10) implies the result 


dion! =inntn(t)-1=inn 
in n 


which agrees with (7). 
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Μ.δ The Inequality nx <x—1 
We wish to compare In x with x itself for-positive values of x. Consider 
the difference function 


f@=x-—Inx. (12) 
Asx—0, Inx > —00; hence f(x) > oo. 13 
As x > 00, Inzs<x; eas ΣΙ (13) 


To investigate the behavior of f(x) between these limits, we note that 
4 -ἰ 1.0 forz = 1: (14) 


Since f(x) is a continuous function of x satisfying (13) and having a 
single extremum at x = 1, it follows that f (x) must have the appearance 
shown in Fig. M.2 with a minimum at x = 1. Hence 


fe) 2f0Q) (5 signifx = 1), 
or by (12), 
x—Inx>1, 


Thus Inx<x-1 (= sign if x = 1). (15) 


M.6 Evaluation of the Integral Ve 6.5" dx 


The indefinite integral fe-** dx cannot be evaluated in terms of ele- 
mentary functions. Let I denote the desired definite integral 


1= [στ ας (16) 
This integral can be evaluated by exploiting the properties of the expo- 


nential function. Thus we can equally well write (16) in terms of a 
different variable of integration, i.e., 


1ΞΞ: [7 er dy. (17) 


1) ae eee 


Fig. M.2 The function f(x) =x —Inx asa 
function of x. 


Fig. M.3_ The polar coordinates r and φ used 
for evaluating the integral (18). 
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Multiplication of (16) and (17) then yields 
P= ie es? de [ew dy 
alr As ne tev dx dy 
or R= ἼΩΝ [σοῖο dx dy. (18) 
This double integral extends thus over the entire xy plane. 

Let us express the integration over this plane in terms of polar coor- 
dinates r and y. Then one has simply x? + y? = τῶ, and the element 
of area in these coordinates is given by (γ ἐγ ἀφ). In order to cover 
the entire plane, the variables p and r must range over the values 
0<9< 27 and0<r< oo. Hence (18) becomes 

P= iis fern dr de = anf “e-tr dr (19) 


since the integration over @ is immediate. But the factor r in this inte- 
grand makes the evaluation of this last integral trivial. Thus 


P = 2nf"(—4) de) = =e] Ξε το εξ): 
or I= yn. 


Since e~7* assumes the same value for x and -- , it also follows that 
fle dx = 2[ dx. 


Hence foes dx = }y7z. (21) 
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M.7 Evaluation of Integrals of the Form [ἰῷ dx 
Denote the desired integral by 
ho f erestan de (22) 
Putting x = a~1/2y, the integral becomes for n = 0 
= ΟΝ εν dy = Vega (23) 
where we have used the result (21). Similarly, 
= af ey dy = a —te-e | = ἐξα. (24) 


All other integrals where n is any integer such that n > 3 can then be 
calculated in terms of Io or I; by successive integrations by parts, 
Indeed, 


few de= fears 


ΣΝ ΣΎ ea 
= dew it [πιὰ 


Since the integrated term vanishes at both limits, we thus obtain 


(25) 


For example, 


ΓΞ Ὁ: VE gran, (26) 


